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attB 1 : 5 '- AC AAGTTTGT ACAAAAAAGC AGGCT-3 ' 



attB2: 5'-ACCCAGCTTTCTTGTACAAAGTGGT-3' 



attPl : 5'-TACAGGTCACTAATACCATCTAAGTAGTTGATTCATAGTGACTGGATATG- 
TTGTGTTTTACAGTATTATGTAGTCTGTTTTTTATGCAAAATCTAATTTA- 
ATATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTTTTGTAC- 
AAAGTTGGCATTATAAAAAAGCATTGCTCATCAATTTGTTGCAACGAACA- 
GGTCACTATCAGTCAAAATAAAATCATTATTTG-3' 



attP2 : 5 '-CAAATAATGATTTT ATTTTGACTGAT AGTGACCTGTTCGTTGC AACAAAT- 
TGATAAGCAATGCTTTCTTATAATGCCAACTTTGTACAAGAAAGCTGAAC- 
GAGAAACGTAAAATGATATAAATATCAATATATTAAATTAGATTTTGCAT- 
AAAAAACAGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGA- 
ATCAACTACTTAGATGGTATTAGTGACCTGTA-3' 



attRl : 5 '- ACAAGTTTGT AC AAAAAAGCTGAACG AGAAACGT AAAATGATAT AAA- 
TATCAATATATTAAATTAGATTTTGCATAAAAAACAGACTACATAATAC- 
TGT AAAAC AC AACAT ATCCAGTC ACT ATG-3 1 



attPv2 : 5'-GCAGGTCGACCATAGTGACTGGATATGTTGTGTTTTACAGTATTAT- 
GTAGTCTGTTTTTTATGCAAAATCTAATTTAATATATTGATATTT- 
AT ATC ATTTT ACGTTTCTCGTTC AGCTTTCTTGT ACAAAGTGGT-3 ' 



attLl : 5'-CAAAT AATGATTTT ATTTTGACTGAT AGTGACCTGTTCGTTGC AAC- 
AAATTGATAAGCAATGCTTTTTT AT AATGCCAACTTTGT ACAAAAAA- 
GC AGGCT-3 1 



attL2 : 5-CAAATAATGATTTTATTTTGACTGAT AGTGACCTGTTCGTTGC AACAA- 
ATTGATAAGCAATGCTTTCTTATAATGCCAACTTTGTACAAGAAAGCTGGGT-3 1 
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Figure /G#: Cloning sites of ihe onty Vector p E^-l & (reading 

frame A) 



Or a I Xmn I Sail SamH I Kpn I C coR I 

i 1 i 1 1 1 1 



ACT TTG TAC AAA AAA GCA GGC TTT I AAA GGA ACCIAAT TCA G fTC GA C TGp ATC CGGTACJ CGl*_ATT C 
TgT He ATG TTT TTT CGT CCG AAA ItTT CCT TGgItTA AGT CAG-CTiG ACCTAGI GCFaTG GCT^AAl G 
llr leu tyr lys lys ala gly phe lys gly thr asn ser val asp trp ile arg tyr arg He 



fcoR I 



Not I 



Xho I EcoR V 



"i r 



ccdB gene 



-GIAAT TCG CGG CCG CAC ITCGAGA T)AT CTA GAC CCA GCT TTC TTG TAC AAA 
C TTAAfeC GcHgt GTG AGT11CT ATT A GAT CTG GGT CGA AAG AAC ATG TTT 



r E 
car 



r 51 



Psp^ 406 I 2634 
BsrB I 2623 
B£?s I 2553 
AatW 2488 



Ora I 166 



Nhe I 2470 
Nsp I 2420 
BspLU11 I 2420 
Aff III 2420 

DrcM 2312 

BssS I 2247 

Had II 2176 



AlvAt I 2O06 - 



Eco57 I 1893 




multiple 
T- cloning 
- 1 site 



Kpn I 197 

BssH II 243 

Bsfl 107 I 282 
$rf\ 401 
Sma I 402 
Mam I 413 

Bs« I 514 

Bsa i 536 
Not I 632 

EcoR V 646 



Pvu II 760 
Psf I 763 



multiple 

cloning 

site 



Rca I 826 



Nrv I 957 



PflM I 1559 

v*sp 



Esp3 



EcoH I 1210 
Sspl 1225 
Sgf\ 1298 



Pvu I 1299 



pENTRIA 2717 bp 

Base Hos . Gene Encoded 



67 . 


. 166 


attLl 


321 . 


. 626 


ccdB 


655 . 


. 754 


attL2 


877 . 


. 1686 


KmR 


1791 


. .2364 


ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTAA AGGAACCAAT 
181 TCAGTCGACT GGATCCGGTA CCGAATTCGC TTACTAAAAG C C AG AT AAC A GTATGCGTAT 
241 TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA TGTATACCCG AAGTATGTCA 
3 01 AAAAGAGGTG TGCTTCTAGA ATGCAGTTTA AGGTTTACAC CTATAAAAGA GAGAGCCGTT 
3 61 ATCGTCTGTT TGTGGATGTA CAGAGTGATA TTATTGACAC GCCCGGGCGA CGGATAGTGA 
421 TCCCCCTGGC CAGTGCACGT CTGCTGTCAG ATAAAGTCTC CCGTGAACTT TACCCGGTGG 
481 TGCATATCGG GGATGAAAGC TGGCGCATGA TGACCACCGA TATGGCCAGT GTGCCGGTCT 
541 CCGTTATCGG GGAAGAAGTG GCTGATCTCA GCCACCGCGA AAATGACATC AAAAACGCCA 
6 01 TTAACCTGAT GTTCTGGGGA ATATAGAATT CGCGGCCGCA CTCGAGATAT CTAGACCCAG 
661 CTTTCTTGTA CAAAGTTGGC ATTATAAGAA AGCATTGCTT ATCAATTTGT TGCAACGAAC 
721 AGGTCACTAT CAGTCAAAAT AAAATCATTA TTTGCCATCC AGCTGCAGCT CTGGCCCGTG 
781 TCTCAAAATC TCTGATGTTA CATTGCACAA GATAAAAATA TATCATCATG AACAATAAAA 
841 CTGTCTGCTT ACATAAACAG TAATACAAGG GGTGTTATGA GCCATATTCA ACGGGAAACG 
901 TCGAGGCCGC GATTAAATTC CAACATGGAT GCTGATTTAT ATGGGTATAA ATGGGCTCGC 
961 GATAATGTCG GGCAATCAGG TGCGACAATC TATCGCTTGT ATGGGAAGCC CGATGCGCCA 
1021 GAGTTGTTTC TGAAACATGG CAAAGGTAGC GTTGCCAATG ATGTTACAGA TG AG ATGGT C 
10 81 AGACTAAACT GGCTGACGGA ATTTATGCCT CTTCCGACCA TCAAGCATTT TATCCGTACT 
1141 CCTGATGATG CATGGTTACT CACCACTGCG ATCCCCGGAA AAACAGCATT CCAGGTATTA 
12 01 GAAGAATATC CTGATTCAGG TGAAAATATT GTTGATGCGC TGGCAGTGTC CCTGCGCCGG 

12 61 TTGCATTCGA TTCCTGTTTG TAATTGTCCT TTTAACAGCG ATCGCGTATT TCGTCTCGCT 

13 21 CAGGCGCAAT CACGAATGAA TAACGGTTTG GTTGATGCGA GTGATTTTGA TGACGAGCGT 
13 81 AATGGCTGGC CTGTTGAACA AGTCTGGAAA GAAATGCATA AACTTTTGCC ATTCTCACCG 
1441 GATTCAGTCG TCACTCATGG TGATTTCTCA CTTGATAACC TTATTTTTGA CGAGGGGAAA 

15 01 TTAATAGGTT GTATTGATGT TGGACGAGTC GGAATCGCAG ACCGATACCA GGATCTTGCC 
1561 AT C C TATGG A ACTGCCTCGG TGAGTTTTCT CCTTCATTAC AGAAACGGCT TTTTCAAAAA 

16 21 TATGGTATTG ATAATCCTGA TATGAATAAA TTGCAGTTTC ATTTGATGCT CGATGAGTTT 
16 81 TTCTAATCAG AATTGGTTAA TTGGTTGTAA C ATT ATT C AG ATTGGGCCCC GTTCCACTGA 
1741 GCGTCAGACC CCGTAGAAAA GATCAAAGGA TCTTCTTGAG ATCCTTTTTT TCTGCGCGTA 
18 01 ATCTGCTGCT TGCAAACAAA AAAACCACCG CTACCAGCGG TGGTTTGTTT GCCGGATCAA 
18 61 GAG C T AC C AA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA GAGCGCAGAT ACCAAATACT 
1921 GTTCTTCTAG TGTAGCCGTA GTTAGGCCAC CACTTCAAGA ACTCTGTAGC ACCGCCTACA 
1981 TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA GTGGCGATAA GTCGTGTCTT 
2 041 ACCGGGTTGG ACTCAAGACG ATAGTTACCG GATAAGGCGC AGCGGTCGGG CTGAACGGGG 
2101 GGTTCGTGCA CACAGCCCAG CTTGGAGCGA ACGACCTACA CCGAACTGAG AT AC C T AC AG 

'2161 CGTGAGCTAT GAGAAAGCGC CACGCTTCCC GAAGGGAGAA AGGCGGACAG GTATC CGGTA 
2 221 AGCGGCAGGG TCGGAACAGG AGAGCGCACG AGGGAGCTTC CAGGGGGAAA CGCCTGGTAT 
22 81 CTTTATAGTC CTGTCGGGTT TCGCCACCTC TGACTTGAGC GTCGATTTTT GTGATGCTCG 
2 3 41 TCAGGGGGGC GGAGCCTATG GAAAAACGCC AGCAACGCGG CCTTTTTACG GTTCCTGGCC 
24 01 TTTTGCTGGC CTTTTGCTCA CATGTTCTTT CCTGCGTTAT CCCCTGATTC TGTGGATAAC 
24 61 CGTATTACCG CTAGCATGGA TCTCGGGGAC GTCTAACTAC TAAGCGAGAG TAGGGAACTG 
2 521 CCAGGCATCA AATAAAACGA AAGGCTCAGT CGGAAGACTG GGCCTTTCGT TTTATCTGTT 
2 581 GTTTGTCGGT GAACGCTCTC CTGAGTAGGA CAAATCCGCC GGGAGCGGAT TTGAACGTTG 
2 641 TGAAGCAACG GCCCGGAGGG TGGCGGGCAG GACGCCCGCC ATAAACTGCC AGGCATCAAA 
2 701 CTAAGCAGAA GGCCATC 



Figure lift: Cloning Sites of the Entry Vector pENTR2B (reading frame*) 



Int attLl 



Ehe i Xronl Sail BamHI 

. -t,„ _L_ „„ h , r Ap* TTC AGbcGA C.TG I GAT CC G 

T ^G TAC AAA AAA GCA GGC jgG O* CGG AAC CAA TTC GCT~1gAC CTAGbc 

AAC ATG TTT| TTT CGT CCG Ab C GLG GCC TTG Gjl ^ ^ ^ 

L eu Tvr Lys Lys Ala Gly Trp ^ Arg Asn Gin Phe Ser Arg Leu Asp Pro 



Kpnl EcoRI 



EcoRI 



NotI 



CAT GGC TTA 



Val Pro Asn 



C TTA 



"^GC GCC GG|C 



Xhol EcoRV Xbal 
\c Itcg AG A 

rG AGC TCT 



1CTA G 


AC CCA 
TO GGT 


GAT C 

N 


V _ 



Int attL2 

GCT TT |C TTG TAC AAA G 
CGA AAG AAC ATGlTTT C 



Ala Phe Leu Tyr Lys 



pENTR2B 2718 bp 

Location (Base Nos . ) . Gene Encoded 

67 . . 166 attLl 

322 . . 627 ccdB 

656 . . 755 attL2 

878 . . 1687 KmR 

1792 . . 2365 ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTGGCG CCGGAACCAA 
181 TTCAGTCGAC TGGATCCGGT ACCGAATTCG CTTACTAAAA GCCAGATAAC AGTATGCGTA 
241 TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT ATGTATACCC GAAGTATGTC 
3 01 AAAAAGAGGT GTGCTTCTAG AATGCAGTTT AAGGTTTACA CCTATAAAAG AGAGAGCCGT 
361 TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA CGCCCGGGCG ACGGATGGTG 
421 ATCCCCCTGG CCAGTGCACG TCTGCTGTCA GATAAAGTCT CCCGTGAACT TTACCCGGTG 
481 GTGCATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG ATATGGCCAG TGTGCCGGTC 
541 TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG AAAATGACAT CAAAAACGCC 
6 01 ATTAACCTGA TGTTCTGGGG AATATAGAAT TCGCGGCCGC ACTCGAGATA TCTAGACCCA 
661 GCTTTCTTGT ACAAAGTTGG CATTATAAGA AAGCATTGCT TATCAATTTG TTGCAACGAA 
721 CAGGTCACTA TCAGTCAAAA TAAAATCATT ATTTGCCATC CAGCTGCAGC TCTGGCCCGT 
781 GTCTCAAAAT CTCTGATGTT ACATTGCACA AGATAAAAAT ATATCATCAT GAACAATAAA 
841 ACTGTCTGCT TACATAAACA GTAATACAAG GGGTGTTATG AG C CAT ATT C AACGGGAAAC 
901 GTCGAGGCCG CGATTAAATT CCAACATGGA TGCTGATTTA TATGGGTATA AATGGGCTCG 
961 CGATAATGTC GGGCAATCAG GTGCGACAAT CTATCGCTTG TATGGGAAGC CCGATGCGCC 
1021 AGAGTTGTTT CTGAAACATG GCAAAGGTAG CGTTGCCAAT GATGTTACAG ATGAGATGGT 
10 81 CAGACTAAAC TGGCTGACGG AATTTATGCC TCTTCCGACC ATCAAGCATT TTATCCGTAC 
1141 TCCTGATGAT GCATGGTTAC TCACCACTGC GATCCCCGGA AAAACAGCAT TCCAGGTATT 
12 01 AGAAGAATAT CCTGATTCAG GTGAAAATAT TGTTGATGCG CTGGCAGTGT TCCTGCGCCG 

12 61 GTTGCATTCG ATTCCTGTTT GTAATTGTCC TTTTAACAGC GATCGCGTAT TTCGTCTCGC 

13 21 TCAGGCGCAA TCACGAATGA ATAACGGTTT GGTTGATGCG AGTGATTTTG ATGACGAGCG 
13 81 TAATGGCTGG CCTGTTGAAC AAGTCTGGAA AGAAATGCAT AAACTTTTGC CATTCTCACC 
1441 GGATTCAGTC GTCACTCATG GTGATTTCTC ACTTGATAAC CTTATTTTTG ACGAGGGGAA 

15 01 ATTAATAGGT TGTATTGATG TTGGACGAGT CGGAATCGCA G AC CGAT AC C AGGATCTTGC 
1561 CATCCTATGG AACTGCCTCG GTGAGTTTTC TCCTTCATTA CAGAAACGGC TTTTTCAAAA 

16 21 ATATGGTATT GATAATCCTG ATATGAATAA ATTGCAGTTT CATTTGATGC TCGATGAGTT 
16 81 TTTCTAATCA GAATTGGTTA ATTGGTTGTA ACATTATTCA GATTGGGCCC CGTTCCACTG 
1741 AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA GATCCTTTTT TTCTGCGCGT 
18 01 AATCTGCTGC TTGCAAACAA AAAAAC C AC C GCTACCAGCG GTGGTTTGTT TGCCGGATCA 
1861 AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC AGAGCGCAGA TACCAAATAC 
1921 TGTTCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACTCTGTAG CACCGCCTAC 
1981 ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC AGTGGCGATA AGTCGTGTCT 
2 041 TACCGGGTTG GAC TCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG 

z 2101 GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC ACCGAACTGA GATACCTACA 
'2161 GCGTGAGCTA TGAGAAAGCG CCACGCTTCC C G AAGGG AG A AAGGCGGACA GGTATCCGGT 
2221 AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA 
22 81 TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGTCGATTTT TGTGATG CTC 
2 341- GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG GCCTTTTTAC GGTTCCTGGC 
2 4 01 CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA TCCCCTGATT CTGTGGATAA 
24 61 CCGTATTACC GCTAGCATGG ATCTCGGGGA CGTCTAACTA CTAAGCGAGA GTAGGGAACT 
2 521 GCCAGGCATC AAATAAAACG AAAGGCTCAG TCGGAAGACT GGGCCTTTCG TTTTATCTGT 
2 581 TGTTTGTCGG TGAACGCTCT CCTGAGTAGG ACAAATCCGC CGGGAGCGGA TTTGAACGTT 
2 641 GTGAAGCAAC GGCCCGGAGG GTGGCGGGCA GGACGCCCGC CATAAACTGC CAGGCATCAA 
2 701 ACTAAGCAGA AGGCCATC 
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Figure pAl Cloning Sites of the Entry Vector pENTR3C (reading frame C) 



Int 



attLl 



Dral 



XmnI 



Sail 



BamHI 



Tl fe TAC AAA ^ AAA GCA GGC l{CT TTA AAG GAA CCA ATT CAg |tCG AC T g Ba TCC GGT 



AAC ATG TTT 



TTT CGT CCG ApA AAfT TTC CTT GGT TAA GTC AGC TpA CCT A' 



Leu Tyr Lys Lys Ala Gly Ser Leu Lys Glu Pro lie Gin Ser Thr Gly Ser Gly 



Kpnl EcoRI 



Pvul 



EcoRI 



Acjc g| aA TT C GAT cb c — ccdB 
TGG CTT A^G^jCTA GCG 

Thr Glu Phe 



- -G [AATTCG 

C TTA A)3C GCC G 



NotI 

cJgg 



Xhol 



EcoRV Xbal 



Asn 



V 

Ser 



CCG CAclTCG AGA ijAT I CTA 

T ATA 

V 



GAT 



^2 GTG AGC TjC 
Arg Pro His Ser Arg Tyr Leu 



attL2 



Int 



GAC CCA GCT TTC TTG TAC AAA G 



C TG GGT CGA AAG AAC ATG TTT C 
Asp Pro Ala Phe Leu Tyr Lys 



pENTR3C 2723 bp 

Location (Base Nos . ) Gene Encoded 

67 . . 166 attLl 

327 . . 632 CCdB 

661 . . 760 attL2 

883 . . 1692 KmR 

1797 . . 2370 ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTCTTT AAAGG AAC C A 
181 ATTCAGTCGA CTGGATCCGG TACCGAATTC GAT CG C TT AC TAAAAGCCAG ATAACAGTAT 
241 GCGTATTTGC GCGCTGATTT TTGCGGTATA AGAATATATA CTGATATGTA TACCCGAAGT 

3 01 ATGTCAAAAA GAGGTGTGCT TCTAGAATGC AGTTTAAGGT TT AC AC CTAT AAAAGAGAGA 
361 GCCGTTATCG TCTGTTTGTG GATGTACAGA GTGATATTAT TGACACGCCC GGGCGACGGA 
421 TGGTGATCCC CCTGGCCAGT GCACGTCTGC TGTCAGATAA AGTCTCCCGT GAACTTTACC 

4 81 CGGTGGTGCA TATCGGGGAT GAAAGCTGGC GCATGATGAC CACCGATATG GCCAGTGTGC 
541 CGGTCTCCGT TATCGGGGAA GAAGTGGCTG ATCTCAGCCA CCGCGAAAAT GACATCAAAA 
6 01 ACGCCATTAA CCTGATGTTC TGGGGAATAT AGAATTCGCG GCCGCACTCG AGATATCTAG 
661 AC C CAGCTTT CTTGTACAAA GTTGGCATTA TAAGAAAGCA TTGCTTATCA ATTTGTTGCA 
721 ACGAACAGGT CACTATCAGT CAAAATAAAA TCATTATTTG CCATCCAGCT GCAGCTCTGG 
781 CCCGTGTCTC AAAATCTCTG ATGTTACATT G C AC AAG AT A AAAAT AT AT C ATCATGAACA 
841 ATAAAACTGT CTGCTTACAT AAACAGTAAT ACAAGGGGTG TTATGAGCCA TATTCAACGG 
9 01 GAAACGTCGA GGCCGCGATT AAATTCCAAC ATGGATGCTG ATTTATATGG GTATAAATGG 
961 GCTCGCGATA ATGTCGGGCA ATCAGGTGCG ACAATCTATC GCTTGTATGG GAAGCCCGAT 

1021 GCGCCAGAGT TGTTTCTGAA ACATGGCAAA GGTAGCGTTG CCAATGATGT TACAGATGAG 
10 81 ATGGTCAGAC TAAACTGGCT GACGGAATTT ATGCCTCTTC CG AC CAT C AA GCATTTTATC 
1141 CGTACTCCTG ATGATGCATG GTTACTCACC ACTGCGATCC CCGGAAAAAC AGCATTCCAG 
12 01 GTATTAGAAG AATATCCTGA TTCAGGTGAA AATATTGTTG ATGCGCTGGC AGTGTTCCTG 

12 61 CGCCGGTTGC ATTCGATTCC TGTTTGTAAT TGTCCTTTTA ACAGCGATCG CGTATTTCGT 

13 21 CTCGCTCAGG CGCAATCACG AATGAATAAC GGTTTGGTTG ATGCGAGTGA TTTTGATGAC 
13 81 GAGCGTAATG GCTGGCCTGT TGAACAAGTC TGGAAAGAAA TGCATAAACT TTTGCCATTC 
1441 TCACCGGATT CAGTCGTCAC TCATGGTGAT TTCTCACTTG ATAACCTTAT TTTTGACGAG 
1501 GGGAAATTAA TAGGTTGTAT TGATGTTGGA CGAGTCGGAA TCGCAGACCG ATACCAGGAT 
1561 CTTGCCATCC TATGGAACTG CCTCGGTGAG TTTTCTCCTT CATTACAGAA ACGGCTTTTT 
16 21 CAAAAATATG GTATTGATAA TCCTGATATG AATAAATTGC AGTTTCATTT GATGCTCGAT 
16 81 GAGTTTTTCT AATCAGAATT GGTTAATTGG TTGTAACATT ATTCAGATTG GGCCCCGTTC 
1741 CACTGAGCGT CAGACCCCGT AG AAAAG AT C AAAGGATCTT CTTGAGATCC TTTTTTTCTG 
18 01 CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG 
18 61 GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC G C AG AT AC C A 
1921 AATAC TGTTC TTCTAGTGTA GCCGTAGTTA GGCCACCACT T C AAG AAC T C TGTAGCACCG 
1981 CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG 
2 041 TGTCTTACCG GGTTGGACTC AAG AC GAT AG TTACCGGATA AGGCGCAGCG GTCGGGCTGA 
,2101 ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTAC&CCGA AC TG AG AT AC 

2161 CTACAGCGTG AGC TATGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT 
2 221 CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC 
22 81 TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA 
2 34V TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC 
24 01 CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG 
2461 GATAACCGTA TTACCGCTAG CATGGATCTC GGGGACGTCT AAC TACT AAG CGAGAGTAGG 
2 521 GAACTGCCAG G CAT C AAAT A AAACGAAAGG CTCAGTCGGA AGACTGGGCC TTTCGTTTTA 
2 581 TCTGTTGTTT GTCGGTGAAC GCTCTCCTGA G T AGGAC AAA TCCGCCGGGA GCGGATTTGA 
2641 ACGTTGTGAA GCAACGGCCC GGAGGGTGGC GGGCAGGACG CCCGCCATAA ACTGCCAGGC 
2 7 01 ATCAAACTAA GCAGAAGGCC ATC 





Figure 13^- Cloning Sites of the Entry Vector pENTR4 



Int attLl 



Ncol 



Kozak XmnI 



Sail 



BamHI 



Ti b TAG AAA AAA GCA GGC TZC Ac fc ATG GGA ACC AAT TCA g }tC GAG TG b ATC CGG 
AAC ATG TTTlTTT CGT CCG A 3G TGG TACj^CCT TGG^TTA AGT CAG CTj3 ACC TAG|GC^T 

Leu Tyr Lys Lys Ala Gly Ser Thr Met Gly Thr Asn Ser Val Asp Trp lie Arg 



Kpnl EcoRI 

TAcl c dA ATT _ C-- 
ATG GCT TAA|G 

Tyr Arg lie 



EcoRI 



Not I 



Xhol 



EcoRV Xbal 



ccdB 



C TTA 



^GC GCC GGjC 



GTG AGC 



v 



|CTA G 


AC CCA GCT 
VG GGT CGA 


GAT C 

n| 


V 



Asn Ser Arg Pro His Ser Arg Tyr Leu Asp Pro Ala 



Int 



attL2 



T TC TTG TAC AAA G 
AAG AAC ATG ITTT C 



Phe Leu Tyr Lys 



pENTR4 2720 bp 

lotion (B^s HQS.) Gene Encoded 

67. .166 attLl 

324.. 629 = cdB 

658 . .757 attL.2 

880 . .1689 KmR 

1794 . .2367 °ri 

1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
i2i aagcaatSS ^Sataat GCCAACTTTG TACAAAAAAG CAGGCTCCAC catgggaacc 
III a^c1g?cg actggatccg gtaccgaatt cgcttactaa aagccagata acagtatgcg 
III ™Stgcgcg ctgatttttg cggtataaga atatatactg atatgtatac ccgaagtatg 
III SaaaaagS SSgcttct agaatgcagt ttaaggttta cacctataaa agagagagcc 
lei gSa^SSS gtttgtggat gtacagagtg atattattga cacgcccggg cgacggatgg 
III ?ga^cccc? ggccagtgca cgtctgctgt cagataaagt ctcccgtgaa ctttacccgg 
III SSScatS cggggatgaa agctggcgca tgatgaccac cgatatggcc agtgtgccgg 
til S?ccgJtat cggggaagaa gtggctgatc tcagccaccg cgaaaatgac atcaaaaacg 
6m ccStaacct gatgttctgg ggaatataga attcgcggcc gcactcgaga tatctagacc 
lei cSS^St gScaaagtt ggcattataa gaaagcattg cttatcaatt tgttgcaacg 
ill aaJaggtcac ?Scagtcaa aataaaatca ttatttgcca tccagctgca gctctggccc 

ill SScTCAAA ATCTCTGATG TTACATTGCA CAAGATAAAA ATATATCATC ATGAACAATA 

III aaactgtSS Stacataaa cagtaataca aggggtgtta TGAGCCATAT TCAACGGGAA 

901 A^CGAGGC CGCGATTAAA TTCCAACATG GATGCTGATT TATATGGGTA TAAATGGGCT 

96^ ScgaSStg SSgcaatc aggtgcgaca atctatcgct tgtatgggaa gcccgatgcg 
illl ccag™g? ttctgaaaca tggcaaaggt agcgttgcca atgatgttac agatgagatg 

loll GTCAGACTAA ACTGGCTGAC GGAATTTATG CCTCTTCCGA CCATCAAGCA TTTTATCCGT 

llll acSSgg^ atgcatggtt actcaccact gcgatccccg gaaaaacagc ATTCCAGGTA 
llll Jtagaagaa? atcctgattc aggtgaaaat ATTGTTGATG cgctggcagt gttcctgcgc 
llll cgSSSSSt JSStcctgt ttgtaattgt ccttttaaca gcgatcgcgt atttcgtctc 
llll ScIggSc SScacgaat gaataacggt ttggttgatg cgagtgattt tgatgacgag 
llll cSSSgct ggcctgttga acaagtctgg aaagaaatgc ataaactttt gccattctca 
llll ccggItSS ?cgtcactca tggtgatttc tcacttgata accttatttt tgacgagggg 
llll aSSaatag gttgtattga tgttggacga gtcggaatcg cagaccgata ccaggatctt 
llll SJSSSm ggaactgcct cggtgagttt tctccttcat tacagaaacg gctttttcaa 

IWl AAATATGGTA TTGATAATCC TGATATGAAT AAATTGCAGT TTCATTTGAT GCTCGATGAG 

llll mn S cIgaattggt taattggttg taacattatt cagattgggc cccgttccac 
All SgcgSSg Scccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc 
i8oi Saa^ctgc? ScSScaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat 
llll caSIgctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat 
llll aotgScSc Sgtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct 

llll ScIScScG CTCTGCTAAT CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT 
2041 SSScGGGT TGGACTCAAG ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGG C TG AACG 

Hoi ggSgttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta 
' 'llll cagStgagc Stgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg 

2221 GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC ACGAGGGAGC TTCCAGGGGG AAACGCCTGG 
2281 ?atcStata GTCCTGTCGG GTTTCGCCAC CTCTGACTTG AGCGTCGATT tttgtgatgc 

H A SgtcLggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg 
2401 gSStgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat 
mi aaccgtatta ccgctagcat ggatctcggg gacgtctaac tactaagcga gagtagggaa 
Isll Sgccaggca tcaaataaaa cgaaaggctc agtcggaaga ctgggccttt cgttttatct 
llll ZZtttotc ggtgaacgct ctcctgagta ggacaaatcc gccgggagcg gatttgaacg 
2641 TTGTGAAGCA acggcccgga gggtggcggg caggacgccc gccataaact gccaggcatc 

27 01 AAACTAAGCA G AAGGC CATC 



9 



Figure / ^Cloning sites of the tr*+y Vector p bM^ 



■-^ g tac aaa aaa gca ggc tfc t c^tjtg waJB ut tea gfes. 
^ >2^5ti^^t: cot ccg «U gtT^c cct tggjtta agt cag 
k ^ ^ * e ^5 -ft, /W. *r >U( 



gac tg fr ate 
"ctjg acc tag 



cq p tacl cg W att cge - 
|gcjc atg get taa^gcg - 

Arg Tlj<- Arj lie 



- Daatli --- agj^att cge 
(ccdB) tct taajgeg. 



frge 



cge 

:g 



acjt cga 



gat at (e tag la 
cta^tag atcyt 



ccg c^g tgagct^ 



ice cag ci 

gg gtc aaa aaa aca 



• 



pENTRS 2720 bp 



Location (Base Nos . ) 



Gene Encoded 



67 . . 166 
324 . .629 
658 . . 757 
880 . . 1689 



attLl 

ccdB 

attL.2 



KmR 



1794. .2367 



orx 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTCA TATGGGAACC 
181 AATTCAGTCG ACTGGATCCG GTACCGAATT CGCTTACTAA AAG C C AG AT A ACAGTATGCG 
241 TATTTGCGCG CTGATTTTTG CGGTATAAGA ATATATACTG ATATGTATAC CCGAAGTATG 
3 01 TCAAAAAGAG GTGTGCTTCT AGAATGCAGT TTAAGGTTTA C AC C TAT AAA AG AGAGAGC C 
3 61 GTTATCGTCT GTTTGTGGAT GTACAGAGTG ATATTATTGA CACGCCCGGG CGACGGATGG 
421 TGATCCCCCT GGCCAGTGCA CGTCTGCTGT CAGATAAAGT CTCCCGTGAA CTTTACCCGG 
481 TGGTGCATAT CGGGGATGAA AGCTGGCGCA TGATGACCAC CGATATGGCC AGTGTGCCGG 
541 TCTCCGTTAT CGGGGAAGAA GTGGCTGATC TCAGCCACCG CGAAAATGAC ATCAAAAACG 
601 CCATTAACCT GATGTTCTGG GGAATATAGA ATTCGCGGCC GCACTCGAGA TATCTAGACC 
661 CAGCTTTCTT GTACAAAGTT GGCATTATAA GAAAGCATTG CTTATCAATT TGTTGCAACG 
721 AACAGGTCAC TATCAGTCAA AATAAAATCA TTATTTGCCA TCCAGCTGCA GCTCTGGCCC 
781 GTGTCTCAAA ATCTCTGATG TTACATTGCA CAAGATAAAA ATATATCATC ATGAACAATA 
841 AAACTGTCTG CTTACATAAA CAGTAATACA AGGGGTGTTA TGAGCCATAT TCAACGGGAA 
901 ACGTCGAGGC CGCGATTAAA TTCCAACATG GATGCTGATT TATATGGGTA TAAATGGGCT 
961 CGCGATAATG TCGGGCAATC AGGTGCGACA ATCTATCGCT TGTATGGGAA GCCCGATGCG 
1021 C C AG AGTTGT TTCTGAAACA TGGCAAAGGT AGCGTTGCCA ATGATGTTAC AGATGAGATG 
1081 GTCAGACTAA ACTGGCTGAC GGAATTTATG CCTCTTCCGA C CATC AAG C A TTTTATCCGT 
1141 ACTCCTGATG ATGCATGGTT ACTCACCACT GCGATCCCCG GAAAAACAGC ATTCCAGGTA 
12 01 TTAGAAGAAT ATCCTGATTC AGGTGAAAAT ATTGTTGATG CGCTGGCAGT GTTCCTGCGC 

12 61 CGGTTGCATT CGATTCCTGT TTGTAATTGT CCTTTTAACA GCGATCGCGT ATTTCGTCTC 

13 21 GCTCAGGCGC AATCACGAAT GAATAACGGT TTGGTTGATG CGAGTGATTT TGATGACGAG 
13 81 CGTAATGGCT GGCCTGTTGA ACAAGTCTGG AAAGAAATGC ATAAACTTTT GCCATTCTCA 
1441 CCGGATTCAG TCGTCACTCA TGGTGATTTC T C AC TTG AT A ACCTTATTTT TGACGAGGGG 

15 01 AAATTAATAG GTTGTATTGA TGTTGGACGA GTCGGAATCG CAGACCGATA CCAGGATCTT 
1561 GCCATCCTAT GGAACTGCCT CGGTGAGTTT TCTCCTTCAT TACAGAAACG GCTTTTTCAA 
1621 AAATATGGTA TTGATAATCC TGATATGAAT AAATTGCAGT TTCATTTGAT GCTCGATGAG 

16 81 TTTTTCTAAT CAGAATTGGT TAATTGGTTG TAACATTATT CAGATTGGGC CCCGTTCCAC 
1741 TGAGCGTCAG ACCCCGTAGA AAAGATCAAA GGATCTTCTT GAGATCCTTT TTTTCTGCGC 
18 01 GTAATCTGCT GCTTGCAAAC AAAAAAACCA CCGCTACCAG CGGTGGTTTG TTTGCCGGAT 
18 61 CAAGAGCTAC CAACTCTTTT TCCGAAGGTA ACTGGCTTCA GCAGAGCGCA GATACCAAAT 
1921 ACTGTTCTTC TAGTGTAGCC GTAGTTAGGC CACCACTTCA AGAACTCTGT AGCACCGCCT 
1981 ACATACCTCG CTCTGCTAAT CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT 
2 041 CTTACCGGGT TGGACTCAAG ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG 

, 2101 GGGGGTTCGT GCACACAGCC CAGCTTGGAG CGAACGACCT ACACCGAACT GAGATACCTA 
2161 CAGCGTGAGC TATGAGAAAG CGCCACGCTT CCCGAAGGGA GAAAGGCGGA CAGGTATCCG 
2221 GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC ACGAGGGAGC TTCCAGGGGG AAACGCCTGG 
22 81 TATCTTTATA GTCCTGTCGG GTTTCGCCAC CTCTGACTTG AGCGTCGATT TTTGTGATGC 
2 34V TCGTCAGGGG GGCGGAGCCT ATGGAAAAAC GCCAGCAACG CGGCCTTTTT ACGGTTCCTG 
2 4 01 GCCTTTTGCT GGCCTTTTGC TCACATGTTC TTTCCTGCGT TATCCCCTGA TTCTGTGGAT 
2 461 AACCGTATTA CCGCTAGCAT GGATCTCGGG GACGTCTAAC TACTAAGCGA GAGTAGGGAA 
2 521 CTGCCAGGCA TCGAATAAAA CGAAAGGCTC AGTCGGAAGA CTGGGCCTTT CGTTTTATCT 
2 581 GTTGTTTGTC GGTGAACGCT CTCCTGAGTA GGACAAATCC GCCGGGAGCG GATTTGAACG 
2 641 TTGTGAAGCA ACGGCCCGGA GGGTGGCGGG CAGGACGCCC GCCATAAACT GCCAGGCATC 
2 7 01 AAAC TAAGC A GAAGGCCATC 





Figure / Vk Cloning sites of the bA f^f Vector 
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pENTR6 2717 bp 



Location (Base Nos . ) Gene Encoded 

67 . . 166 attLl 

321 . . 626 ccdB 

655 . . 754 attL2 

877 . . 1686 KmR 

1791 . .2364 ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 

61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 

121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTGCAT GCGAACCAAT 

181 TCAGTCGACT GGATCCGGTA CCGAATTCGC TTACTAAAAG CCAGATAACA GTATGCGTAT 

241 TTGCGCGCTG ATTTTTGCGG TATAAGAATA TATACTGATA TGTATACCCG AAGTATGTCA 

3 01 AAAAGAGGTG TGCTTCTAGA ATGCAGTTTA AGGTTTACAC C T AT AAAAG A GAGAGCCGTT 

361 ATCGTCTGTT TGTGGATGTA CAGAGTGATA TTATTGACAC GCCCGGGCGA CGGATGGTGA 

421 TCCCCCTGGC CAGTGCACGT CTGCTGTCAG ATAAAGTCTC CCGTGAACTT TACCCGGTGG 

481 TGCATATCGG GGATGAAAGC TGGCGCATGA TGACCACCGA TATGGCCAGT GTGCCGGTCT 

541 CCGTTATCGG GGAAGAAGTG GCTGATCTCA GC C AC CG C GA AAATGAC AT C AAAAACGCCA 

6 01 TTAACCTGAT GTTCTGGGGA ATATAGAATT CGCGGCCGCA CTCGAGATAT CTAGACC CAG 

661 CTTTCTTGTA CAAAGTTGGC ATTATAAGAA AGCATTGCTT ATCAATTTGT TGCAACGAAC 

721 AGGTCACTAT CAGTCAAAAT AAAATCATTA TTTGCCATCC AGCTGCAGCT CTGGCCCGTG 

781 TCTCAAAATC TCTGATGTTA CATTGCACAA GATAAAAATA TAT CATCATG AACAATAAAA 

841 CTGTCTGCTT ACATAAACAG TAATACAAGG GGTGTTATGA GCCATATTCA ACGGGAAACG 

901 TCGAGGCCGC GATTAAATTC CAACATGGAT GCTGATTTAT ATGGGTATAA ATGGGCTCGC 

961 GATAATGTCG GGCAATCAGG TGCGACAATC TATCGCTTGT ATGGGAAGCC CGATGCGCCA 

1021 GAGTTGTTTC TGAAACATGG CAAAGGTAGC GTTGCCAATG ATGTTACAGA TGAGATGGTC 

10 81 AGACTAAACT GGCTGACGGA ATTTATGCCT CTTCCGACCA TCAAGCATTT TATCCGTACT 

1141 CCTGATGATG CATGGTTACT CACCACTGCG ATCCCCGGAA AAACAGCATT CCAGGTATTA 

12 01 GAAGAATATC CTGATTCAGG TGAAAATATT GTTGATGCGC TGGCAGTGTT CCTGCGCCGG 

12 61 TTGCATTCGA TTCCTGTTTG TAATTGTCCT TTTAACAGCG ATCGCGTATT TCGTCTCGCT 

13 21 CAGGCGCAAT CACGAATGAA TAACGGTTTG GTTGATGCGA GTGATTTTGA TGACGAGCGT 
13 81 AATGGCTGGC CTGTTGAACA AGTCTGGAAA GAAATGCATA AACTTTTGCC ATTCTCACCG 
1441 GATTCAGTCG TCACTCATGG TGATTTCTCA CTTGATAACC TTATTTTTGA CGAGGGGAAA 

15 01 TTAATAGGTT GTATTGATGT TGGACGAGTC GGAATCGCAG ACCGATACCA GGATCTTGCC 
1561 ATCCTATGGA ACTGCCTCGG TGAGTTTTCT CCTTCATTAC AGAAACGGCT TTTTCAAAAA 
1621 TATGGTATTG ATAATCCTGA TAT G AAT AAA TTGCAGTTTC ATTTGATGCT CGATGAGTTT 

16 81 T T C T AAT CAG AATTGGTTAA TTGGTTGTAA CATTATTCAG ATTGGGCCCC GTTCCACTGA 
1741 GCGTCAGACC CCGTAGAAAA GAT C AAAGG A TCTTCTTGAG ATCCTTTTTT TCTGCGCGTA 
18 01 ATCTGCTGCT TGCAAACAAA AAAACCACCG CTACCAGCGG TGGTTTGTTT GCCGGATCAA 
1861 GAGCTACCAA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA GAG CGCAGAT ACCAAATACT 
1921 GTTCTTCTAG TGTAGCCGTA GTTAGGCCAC CACTTCAAGA ACTCTGTAGC ACCGCCTACA 
1981 TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA GTGGCGATAA GTCGTGTCTT 
2 041 ACCGGGTTGG ACTCAAGACG. ATAGTTACCG GATAAGGCGC AGCGGTCGGG CTGAACGGGG 

/ ,2101 GGTTCGTGCA CACAGCCCAG CTTGGAGCGA ACGACCTACA CCGAACTGAG ATACCTACAG 

'2161 CGTGAGCTAT GAGAAAGCGC CACGCTTCCC GAAGGGAGAA AGGCGGACAG GTATCCGGTA 

2 221 AGCGGCAGGG TCGGAACAGG AGAGCGCACG AGGGAGCTTC CAGGGGGAAA CGCCTGGTAT 

2281 CTTTATAGTC CTGTCGGGTT TCGCCACCTC TGACTTGAGC GTCGATTTTT GTGATGCTCG 

2 34*1- TCAGGGGGGC GGAGCCTATG GAAAAACGCC AGCAACGCGG CCTTTTTACG GTTCCTGGCC 

24 01 TTTTGCTGGC CTTTTGCTCA CATGTTCTTT CCTGCGTTAT CCCCTGATTC TGTGGATAAC 

2461 CGTATTACCG CTAGCATGGA TCTCGGGGAC GTCTAACTAC TAAG CGAGAG TAGGGAACTG 

2 521 CCAGGCATCA AATAAAACGA AAGGCTCAGT CGGAAGACTG GGCCTTTCGT TTTATCTGTT 

2 581 GTTTGTCGGT GAACGCTCTC CTGAGTAGGA CAAATCCGCC GGGAGCGGAT TTGAACGTTG 

2 641 TGAAGCAACG GCCCGGAGGG TGGCGGGCAG GACGCCCGCC ATAAACTGCC AGGCATCAAA 

2 701 CTAAGCAGAA GGCCATC 



Figure/^ Cloning sites of the £>i1^y Vector 



attll 

tt jg tac aaa aaa gca ggc ttt gaa aac ctg tat ttt caa gga 

ttt cgt ccg aaa ctt ttg gac ata aaa gtt cct 



aac atg ttt 



Leu Tyr Lys 



XmnI 



Lys Ala Gly Phe Glu Asn Leu Tyr Phe GlnjGly 

TEV Protease 



Sail 



Bam 



Kpnl EcoRI 



acc 
tgg 
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caa agt acg tag cag ctjg acc tag| gqc atg get taa| gcg 



|gc|c atg" get taa| 
Thr Val Ser Cys He Val Asp Trp He Arg Tyr Arg He 
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Not I 



Xhol EcoRV Xbal 



Death 
(ccdB) 



4 



ada att 



tct taa 



cgc 
gcg 



|ggc cgc ac[t 



ccg gbg tga get 



cga gat 
eta 



Lt|c 



tag 



tag ate 



acc cag 
tgg gtc 



Int 



ctt 12 



ctt tct tgt aca aag 



gaa aga aca tct ttc — 



pENTR7 2738 bp 



Location (Base Nos.) Rene Encoded 



67 . . 166 


attLl 


342 . .647 


ccdB 


676 . . 775 


attL2 


898 . . 1707 


KmR 


1812 . .2385 


ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTGA AAACCTGTAT 
181 TTTCAAGGAA CCGTTTCATG CATCGTCGAC TGGATCCGGT ACCGAATTCG CTTACTAAAA 
241 GCCAGATAAC AGTATGCGTA TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT 
3 01 ATGT AT AC C C GAAGTATGTC AAAAAGAGGT GTGCTTCTAG AATGCAGTTT AAGGTTTACA 
361 CCTATAAAAG AGAGAGCCGT TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA 
421 CGCCCGGGCG ACGGATAGTG ATCCCCCTGG CCAGTGCACG TCTGCTGTCA GATAAAGTCT 
481 CCCGTGAACT TTACCCGGTG GTGCATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG 
541 ATATGGCCAG TGTGCCGGTC TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG 
6 01 AAAATGACAT CAAAAACGCC ATTAACCTGA TGTTCTGGGG AATATAGAAT TCGCGGCCGC 
661 ACTCGAGATA TCTAGACCCA GCTTTCTTGT ACAAAGTTGG CATTATAAGA AAGCATTGCT 
721 TATCAATTTG TTGCAACGAA CAGGTCACTA TCAGTCAAAA TAAAATCATT ATTTGCCATC 
781 CAGCTGCAGC TCTGGCCCGT GTCTCAAAAT CTCTGATGTT ACATTGCACA AGATAAAAAT 
841 ATATCATCAT GAACAATAAA ACTGT CTGCT TACATAAACA GTAATACAAG GGGTGTTATG 
901 AG C CAT AT T C AACGGGAAAC GTCGAGGCCG CGATTAAATT CCAACATGGA TGCTGATTTA 
961 TATGGGTATA AATGGGCTCG CGATAATGTC GGGCAATCAG GTGCGACAAT CTATCGCTTG 
1021 TATGGGAAGC CCGATGCGCC AGAGTTGTTT CTGAAACATG GCAAAGGTAG CGTTGCCAAT 
10 81 GATGTTACAG ATGAGATGGT CAGACTAAAC TGGCTGACGG AATTTATGCC TCTTCCGACC 
1141 ATCAAGCATT TTATCCGTAC TCCTGATGAT GCATGGTTAC TCACCACTGC GATCCCCGGA 

12 01 AAAACAGCAT TCCAGGTATT AGAAGAATAT CCTGATTCAG GTGAAAATAT TGTTGATGCG 
1261 CTGGCAGTGT TCCTGCGCCG GTTGCATTCG ATTCCTGTTT GTAATTGTCC TTTTAACAGC 

13 21 GATCGCGTAT TTCGTCTCGC TCAGGCGCAA TCACGAATGA ATAACGGTTT GGTTGATGCG 
13 81 AGTGATTTTG ATGACGAGCG TAATGGCTGG CCTGTTGAAC AAGTCTGGAA AGAAATGCAT 
1441 AAACTTTTGC CATTCTCACC GGATTCAGTC GTCACTCATG GTGATTTCTC ACTTGATAAC 
1501 CTTATTTTTG ACGAGGGGAA ATTAATAGGT TGTATTGATG TTGGACGAGT CGGAATCGCA 
1561 GACCGATACC AGGATCTTGC CATCCTATGG AACTGCCTCG GTGAGTTTTC TCCTTCATTA 
1621 CAGAAACGGC TTTTTCAAAA ATATGGTATT GATAAT C CTG ATATGAATAA ATTGCAGTTT 
1681 CATTTGATGC TCGATGAGTT TTTCTAATCA GAATT GGTT A ATTGGTTGTA ACATTATTCA 
1741 GATTGGGCCC CGTTCCACTG AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA 
18 01 GATCCTTTTT TTCTGCGCGT AATCTGCTGC TTGCAAACAA AAAAACCACC GCTACCAGCG 
18 61 GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC 
1921 AG AG CG C AG A T AC C AAAT AC TGTTCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 
1981 AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC 
2 041 AGTGGCGATA AGTCGTGTCT T AC CGGGTTG GACT CAAGAC GATAGTT AC C GGATAAGGCG 

,2101 CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC 
2161 ACCGAACTGA GATACCTACA GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 
2221 AAGG CGGAC A GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT 
22 81 CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 
2 341 .-CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG 
24 01 GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA 
2461 TCCCCTGATT CTGTGGATAA CCGTATTACC GCTAGCATGG ATCTCGGGGA CGTCTAACTA 
2 521 CTAAGCGAGA GTAGGGAACT GCCAGGCATC AAATAAAACG AAAGGCTCAG TCGGAAGACT 
2 5 81 GGGCCTTTCG TTTTATCTGT TGTTTGTCGG TGAACGCTCT CCTGAGTAGG ACAAATCCGC 
2 641 CGGGAGCGGA TTTGAACGTT GTGAAGCAAC GGCCCGGAGG GTGGCGGGCA GGACGCCCGC 
27 01 CATAAACTGC CAGGCATCAA ACTAAGCAGA AGGCCATC 




Figure p/>: Cloning Sites of the 6<\tQi Vector ff^fi 




gu uc ctg tat ttt caa gga 
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gaa aga aca tg^t ttc^y* 



pENTR8 

Location (Base Nos . ) 



67 . . 


166 


339. . 


644 


673 . . 


772 


895 . . 


1704 


1809 . 


.2382 




2735 bp 

Rene Encoded 

attLl 

ccdB 

attL2 

KmR 

ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTGA AAACCTGTAT 
181 TTTCAAGGAA CCATGGACCT AGTCGACTGG ATCCGGTACC GAATTCGCTT ACTAAAAGCC 
241 AGATAACAGT ATGCGTATTT GCGCGCTGAT TTTTGCGGTA TAAGAATATA TACTGATATG 
3 01 TATACCCGAA GTATGTCAAA AAGAGGTGTG CTTCTAGAAT GCAGTTTAAG GTTTACACCT 
361 ATAAAAGAGA GAGCCGTTAT CGTCTGTTTG TGGATGTACA GAGTGATATT ATTGACACGC 
421 CCGGGCGACG GATAGTGATC CCCCTGGCCA GTGCACGTCT GCTGTCAGAT AAAGTCTCCC 
481 GTGAACTTTA CCCGGTGGTG CATATCGGGG ATGAAAGCTG GCGCATGATG ACCACCGATA 
541 TGGCCAGTGT GCCGGTCTCC GTTATCGGGG AAGAAGTGGC TGATCTCAGC CACCGCGAAA 
6 01 ATGACATCAA AAACGCCATT AACCTGATGT TCTGGGGAAT ATAGAATTCG CGGCCGCACT 
661 CGAGATATCT AGACCCAGCT TTCTTGTACA AAGTTGGCAT TATAAGAAAG CATTGCTTAT 
721 CAATTTGTTG CAACGAACAG GTCACTATCA GTCAAAATAA AATCATTATT TGCCATCCAG 
781 CTGCAGCTCT GGCCCGTGTC TCAAAATCTC TGATGTTACA TTGCACAAGA TAAAAATATA 
841 TCATCATGAA CAATAAAACT GTCTGCTTAC ATAAACAGTA ATACAAGGGG TGTTATGAGC 
901 CATATTCAAC GGGAAACGTC GAGGCCGCGA TTAAATTCCA ACATGGATGC TGATTTATAT 
961 GGGTATAAAT GGGCTCGCGA TAATGTCGGG CAATCAGGTG CGACAATCTA TCGCTTGTAT 
1021 GGGAAGCCCG ATGCGCCAGA GTTGTTTCTG AAACATGGCA AAGGTAGCGT TGCCAATGAT 
1081 GTTACAGATG AGATGGTCAG ACTAAACTGG CTGACGGAAT TTATGCCTCT TCCGACCATC 
1141 AAGCATTTTA TCCGTACTCC TGATGATGCA TGGTTACTCA CCACTGCGAT CCCCGGAAAA 
12 01 ACAGCATTCC AGGTATTAGA AGAATATCCT GATTCAGGTG AAAATATTGT TGATGCGCTG 
1261 GCAGTGTCCC TGCGCCGGTT GCATTCGATT CCTGTTTGTA ATTGTCCTTT TAACAGCGAT 
1321 CGCGTATTTC GTCTCGCTCA GGCGCAATCA CGAATGAATA ACGGTTTGGT TGATGCGAGT 
1381 GATTTTGATG ACGAGCGTAA TGGCTGGCCT GTTGAACAAG TCTGGAAAGA AATGCATAAA 
1441 CTTTTGCCAT TCTCACCGGA TTCAGTCGTC ACTCATGGTG ATTTCTCACT TGATAACCTT 
1501 ATTTTTGACG AGGGGAAATT AATAGGTTGT ATTGATGTTG GACGAGTCGG AATCGCAGAC 
1561 CGATACCAGG ATCTTGCCAT CCTATGGAAC TGCCTCGGTG AGTTTTCTCC TTCATTACAG 
1621 AAACGGCTTT TTCAAAAATA TGGTATTGAT AATCCTGATA TGAATAAATT GCAGTTTCAT 
16 81 TTGATGCTCG ATGAGTTTTT CTAATCAGAA TTGGTTAATT GGTTGTAACA TTATTCAGAT 
1741 TGGGCCCCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT 
1801 CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT ACCAGCGGTG 
1861 GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA 
1921 GCGCAGATAC CAAATACTGT TCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC 
1981 TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC TGCTGCCAGT 
2 041 GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGCGCAG 
2101 CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC 
'2161 GAACTGAGAT ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG 
2221 GCGGACAGGT ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA 
22 81 GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG ACTTGAGCGT 
2 341- CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG CAACGCGGCC 
2 4 01 TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC TGCGTTATCC 
2461 CCTGATTCTG TGGATAACCG TATTACCGCT AGCATGGATC TCGGGGACGT CTAAC TACT A 
2 521 AGCGAGAGTA GGGAACTGCC AGGCATCAAA TAAAACGAAA GGCTCAGTCG GAAGACTGGG 
2 5 81 CCTTTCGTTT TATCTGTTGT TTGTCGGTGA ACGCTCTCCT GAGTAGGACA AATCCGCCGG 
2641 GAGCGGATTT GAACGTTGTG AAGCAACGGC CCGGAGGGTG GCGGGCAGGA CGCCCGCCAT 
2 7 01 AAAGTGCCAG GCATCAAACT AAGCAGAAGG CCATC 




Figure Cloning sites of the bO+^l Vector f> &Jm 
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pENTR9 2735 bp 

Location (Base Nos . ) Gene Encoded 



67 . . 166 


attLl 


339 . . 644 


ccdB 


673 . . 772 


attL2 


895 . . 1704 


KmR 


1809 . .2382 


ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTTGA AAACCTGTAT 
181 TTTCAAGGAC ATATGAGATC TGTCGACTGG ATCCGGTACC GAATTCGCTT ACTAAAAGCC 
241 AG AT AAC AG T ATGCGTATTT GCGCGCTGAT TTTTGCGGTA TAAGAATATA TACTGATATG 
3 01 TAT AC C CG AA GT ATGT C AAA AAGAGGTGTG CTTCTAGAAT GCAGTTTAAG GTTTACACCT 
361 ATAAAAGAGA GAGCCGTTAT CGTCTGTTTG TGGATGTACA GAGTGATATT ATTGACACGC 
421 CCGGGCGACG GATAGTGATC CCCCTGGCCA GTGCACGTCT GCTGTCAGAT AAAGTCTCCC 
481 GTGAACTTTA CCCGGTGGTG CATATCGGGG ATGAAAGCTG GCGCATGATG ACCACCGATA 
541 TGGCCAGTGT GCCGGTCTCC GTTATCGGGG AAGAAGTGGC TGATCTCAGC CACCGCGAAA 
6 01 ATGACATCAA AAACGCCATT AACCTGATGT TCTGGGGAAT ATAGAATTCG CGGCCGCACT 
661 CGAGATATCT AG AC C CAGCT TTCTTGTACA AAGTTGGCAT TATAAGAAAG CATTGCTTAT 
721 CAATTTGTTG CAACGAACAG GTCACTATCA GT C AAAAT AA AATCATTATT TGCCATCCAG 
781 CTGCAGCTCT GGCCCGTGTC TCAAAATCTC TGATGTTACA TTGCACAAGA TAAAAATATA 
841 TCATCATGAA CAATAAAACT GTCTGCTTAC AT AAAC AG T A ATACAAGGGG TGTTATGAGC 
9 01 CATATTCAAC GGGAAACGTC GAGGCCGCGA TTAAATTCCA ACATGGATGC TGATTTATAT 
961 GGGTATAAAT GGGCTCGCGA TAATGTCGGG CAATCAGGTG CGACAATCTA TCGCTTGTAT 
1021 GGGAAGCCCG ATGCGCCAGA GTTGTTTCTG AAACATGGCA AAGGTAGCGT TGCCAATGAT 
10 81 GTTACAGATG AGATGGTCAG ACTAAAC TGG CTGACGGAAT TTATGCCTCT TCCGACCATC 
1141 AAGCATTTTA TCCGTACTCC TGATGATGCA TGGTTACTCA CCACTGCGAT CCCCGGAAAA 
12 01 ACAGCATTCC AGGTATTAGA AGAATATCCT GATTCAGGTG AAAATATTGT TGATGCGCTG 

12 61 GCAGTGTCCC TGCGCCGGTT GCATTCGATT CCTGTTTGTA ATTGTCCTTT TAACAGCGAT 

13 21 CGCGTATTTC GTCTCGCTCA GGCGCAATCA CGAATGAATA ACGGTTTGGT TGATGCGAGT 
13 81 GATTTTGATG ACGAGCGTAA TGGCTGGCCT GTTGAACAAG TCTGGAAAGA AATGCATAAA 
1441 CTTTTGCCAT TCTCACCGGA TTCAGTCGTC ACTCATGGTG ATTTCTCACT TGATAACCTT 
15 01 ATTTTTGACG AGGGGAAATT AATAGGTTGT ATTGATGTTG GACGAGTCGG AATCGCAGAC 

15 61 CGATACCAGG ATCTTGCCAT CCTATGGAAC TGCCTCGGTG AGTTTTCTCC TTCATTACAG 

16 21 AAACGGCTTT TTCAAAAATA TGGTATTGAT AATCCTGATA TGAATAAATT GCAGTTTCAT 
16 81 TTGATGCTCG ATGAGTTTTT CTAATCAGAA TTGGTTAATT GGTTGTAACA TTATTCAGAT 
1741 TGGGCCCCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT 
18 01 CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT AC CAGCGGTG 
18 61 GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA 
1921 GCGCAGATAC CAAATACTGT TCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC 
1981 TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC TGCTGCCAGT 
2 041 GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGCGCAG 

/ ,2101 CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC 
2161 GAACTGAGAT ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG 
2221 GCGGACAGGT ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTC C A 
22 81. GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG ACTTGAGCGT 
2 34*1" CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG CAACGCGGCC 
24 01 TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC TGCGTTATCC 
24 61 CCTGATTCTG TGGATAACCG TATTACCGCT AGCATGGATC TCGGGGACGT CTAACTACTA 
2 521 AGCGAGAGTA GGGAACTGCC AGGCATCAAA TAAAACGAAA GGCTCAGTCG GAAGACTGGG 
2 581 CCTTTCGTTT TATCTGTTGT TTGTCGGTGA ACGCTCTCCT GAGTAGGACA AATCCGCCGG 
2 641 GAGCGGATTT GAACGTTGTG AAGCAACGGC CCGGAGGGTG GCGGGCAGGA CGCCCGCCAT 
2 7 01 AAACTGCCAG G C AT C AAAC T AAGCAGAAGG CCATC 



Figure / 1M Cloning sites of the c*t^ Vector p&M^' 0 
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pENTRlO 2738 bp 



Location (Base Nos . ) 
67 . . 166 
342 . .647 
676 . . 775 
898 . .1707 
1812. .2385 



Gene Encoded 

attLl 

ccdB 

attL2 

KmR 

ori 



1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTCGA ACTAAGGAAA 
181 TACTTACATA TGGGAACCAA TTCAGTCGAC TGGATCCGGT ACCGAATTCG CTTACTAAAA 
241 GCCAGATAAC AGTATGCGTA TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT 
3 01 ATGTATACCC GAAGTATGTC AAAAAGAGGT GTGCTTCTAG AATGCAGTTT AAGGTTTACA 
361 CCTATAAAAG AGAGAGCCGT TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA 
421 CGCCCGGGCG ACGGATGGTG ATCCCCCTGG CCAGTGCACG TCTGCTGTCA GATAAAGTCT 
481 CCCGTGAACT TTACCCGGTG GTGQATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG 
541 ATATGGCCAG TGTGCCGGTC TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG 
6 01 AAAATGACAT CAAAAACGCC ATTAACCTGA TGTTCTGGGG AATATAGAAT TCGCGGCCGC 
661 ACTCGAGATA TCTAGACCCA GCTTTCTTGT ACAAAGTTGG CATTATAAGA AAGCATTGCT 
721 TAT C AATTTG TTGCAACGAA CAGGTCACTA TCAGTCAAAA TAAAATCATT ATTTGCCATC 
781 CAGCTGCAGC TCTGGCCCGT GTCTCAAAAT CTCTGATGTT ACATTGCACA AGATAAAAAT 
841 ATATCATCAT GAACAATAAA ACTGTCTGCT TACATAAACA GTAATACAAG GGGTGTTATG 
901 AGCCATATTC AACGGGAAAC GTCGAGGCCG CGATTAAATT CCAACATGGA TGCTGATTTA 
961 TATGGGTATA AATGGGCTCG CGATAATGTC GGGCAATCAG GTGCGACAAT CTATCGCTTG 
1021 TATGGGAAGC CCGATGCGCC AGAGTTGTTT CTGAAACATG GCAAAGGTAG CGTTGCCAAT 
1081 GATGTTACAG ATGAGATGGT CAGACTAAAC TGGCTGACGG AATTTATGCC TCTTCCGACC 
1141 ATCAAGCATT TTATCCGTAC TCCTGATGAT GCATGGTTAC TCACCACTGC GATCCCCGGA 
12 01 AAAACAGCAT TCCAGGTATT AGAAGAATAT CCTGATTCAG GTGAAAATAT TGTTGATGCG 

12 61 CTGGCAGTGT TCCTGCGCCG GTTGCATTCG ATTCCTGTTT GTAATTGTCC TTTTAACAGC 
1321 GATCGCGTAT TTCGTCTCGC TCAGGCGCAA TCACGAATGA ATAACGGTTT GGTTGATGCG 

13 81 AGTGATTTTG ATGACGAGCG TAATGGCTGG CCTGTTGAAC AAGTCTGGAA AGAAATGCAT 
1441 AAACTTTTGC CATTCTCACC GGATTCAGTC GTCACTCATG GTGATTTCTC ACTTGATAAC 
1501 CTTATTTTTG ACGAGGGGAA ATTAATAGGT TGTATTGATG TTGGACGAGT CGGAATCGCA 
1561 GACCGATACC AGGATCTTGC CATCCTATGG AACTGCCTCG GTGAGTTTTC TCCTTCATTA 
1621 CAGAAACGGC TTTTTCAAAA ATATGGTATT GATAATCCTG ATATGAATAA ATTGCAGTTT 
16 81 CATTTGATGC TCGATGAGTT TTTCTAATCA GAATTGGTTA ATTGGTTGTA ACATTATTCA 
1741 GATTGGGCCC CGTTCCACTG AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA 
18 01 GATCCTTTTT TTCTGCGCGT AATCTGCTGC TTGCAAACAA AAAAAC C AC C GCTACCAGCG 
18 61 GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC 
1921 AGAGCGCAGA T AC C AAAT AC TGTTCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 
1981 AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC 
2 041 AGTGGCGATA AGTCGTGTCT TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG 
.2101 CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC 
2161 ACCGAACTGA GATACCTACA GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 
2221 AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT 
22 81 CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 
2 3 4V CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG 
24 01 GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA 
24 61 TCCCCTGATT CTGTGGATAA CCGTATTACC GCTAGCATGG ATCTCGGGGA CGTCTAACTA 
2 521 CTAAGCGAGA GTAGGGAACT GCCAGGCATC GAATAAAACG AAAGGCTCAG TCGGAAGACT 
2 581 GGGCCTTTCG TTTTATCTGT TGTTTGTCGG TGAACGCTCT CCTGAGTAGG ACAAATCCGC 
2 641 CGGGAGCGGA TTTGAACGTT GTGAAGCAAC GGCCCGGAGG GTGGCGGGCA GGACGCCCGC 
2 701 CATAAACTGC CAGGCATCAA ACTAAGCAGA AGGCCATC 
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pENTRll 2744 bp (rotated to position 2578) 

Location (Rase Nos.) Gene Encoded 

67.. 166 attLl 

348 . . 653 ccdB 

683 . . 781 attL2 

904 . . 1713 KmR 

1818 . .2391 °ri 

1 CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTCCTG TTAGTTAGTT ACTTAAGCTC 
61 GGGCCCCAAA TAATGATTTT ATTTTGACTG ATAGTGACCT GTTCGTTGCA ACAAATTGAT 
121 AAGCAATGCT TTTTTATAAT GCCAACTTTG TACAAAAAAG CAGGCTTCGA AGGAGATAGA 
181 ACCAATTCTC TAAGGAAATA CTTAACCATG GTCGACTGGA TCCGGTACCG AATTCGCTTA 
241 CTAAAAGCCA GATAACAGTA TGCGTATTTG CGCGCTGATT TTTGCGGTAT AAGAATATAT 
3 01 ACTGATATGT ATACCCGAAG TATGTCAAAA AGAGGTGTGC TTCTAGAATG CAGTTTAAGG 
361 TTTACACCTA TAAAAGAGAG AGCCGTTATC GTCTGTTTGT GGATGTACAG AGTGATATTA 
421 TTGACACGCC CGGGCGACGG ATAGTGATCC CCCTGGCCAG TGCACGTCTG CTGTCAGATA 
481 AAGTCTCCCG TGAACTTTAC CCGGTGGTGC ATATCGGGGA TGAAAGCTGG CGCATGATGA 
541 CCACCGATAT GGCCAGTGTG CCGGTCTCCG TTATCGGGGA AGAAGTGGCT GATCTCAGCC 
6 01 ACCGCGAAAA TGAC AT C AAA AACGCCATTA ACCTGATGTT CTGGGGAATA TAGAATTCGC 
661 GGCCGCACTC GAGATATCTA GACCCAGCTT TCTTGTACAA AGTTGGCATT ATAAGAAAGC 
721 ATTGCTTATC AATTTGTTGC AACGAACAGG TCACTATCAG TCAAAATAAA ATCATTATTT 
781 GCCATCCAGC TGCAGCTCTG GCCCGTGTCT CAAAATCTCT GATGTTACAT TGCACAAGAT 
841 AAAAATATAT CATCATGAAC AATAAAACTG TCTGCTTACA TAAACAGTAA TACAAGGGGT 
901 GTTATGAGCC ATATTCAACG GGAAACGTCG AGGCCGCGAT TAAATTCCAA CATGGATGCT 
961 GATTTATATG GGTATAAATG GGCTCGCGAT AATGTCGGGC AATCAGGTGC GACAATCTAT 
1021 CGCTTGTATG GGAAGCCCGA TGCGCCAGAG TTGTTTCTGA AACATGGCAA AGGTAGCGTT 
1081 GCCAATGATG TTACAGATGA GATGGTCAGA CTAAACTGGC TGACGGAATT TATGCCTCTT 
1141 CCGACCATCA AGCATTTTAT CCGTACTCCT GATGATGCAT GGTTACTCAC CACTGCGATC 
12 01 CCCGGAAAAA CAGCATTCCA GGTATTAGAA GAATATCCTG ATTCAGGTGA AAATATTGTT 

12 61 GATGCGCTGG CAGTGTTCCT GCGCCGGTTG CATTCGATTC CTGTTTGTAA TTGTCCTTTT 
1321 AACAGCGATC GCGTATTTCG TCTCGCTCAG GCGCAATCAC GAATGAATAA CGGTTTGGTT 

13 81 GATGCGAGTG ATTTTGATGA CGAGCGTAAT GGCTGGCCTG TTGAACAAGT CTGGAAAGAA 
1441 ATGCATAAAC TTTTGCCATT CTCACCGGAT TCAGTCGTCA CTCATGGTGA TTTCTCACTT 

15 01 GATAACCTTA TTTTTGACGA GGGGAAATTA ATAGGTTGTA TTGATGTTGG ACGAGTCGGA 
1561 ATCGCAGACC GATACCAGGA TCTTGCCATC CTATGGAACT GCCTCGGTGA GTTTTCTCCT 

16 21 TCATTACAGA AACGGCTTTT TCAAAAATAT GGTATTGATA ATCCTGATAT GAATAAATTG 
16 81 CAGTTTCATT TGATGCTCGA TGAGTTTTTC TAATCAGAAT TGGTTAATTG GTTGTAACAT 
1741 T ATT C AG ATT GGGCCCCGTT CCACTGAGCG TCAGACCCCG TAGAAAAGAT CAAAGGATCT 
18 01 TCTTGAGATC CTTTTTTTCT GCGCGTAATC TGCTGCTTGC AAACAAAAAA ACCACCGCTA 
1861 CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC TTTTTCCGAA GGTAACTGGC 
1921 TTCAGCAGAG CGCAGATACC AAATACTGTT CTTCTAGTGT AGCCGTAGTT AGGCCACCAC 
1981 TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT ACCAGTGGCT 
2 041 GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA GTTACCGGAT 
2101 AAGGCGCAGC GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT GGAGCGAACG 
2161 ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG AAAGCGCCAC GCTTCCCGAA 
2221 GGGAGAAAGG CGGACAGGTA TCCGGTAAGC GGCAGGGTCG GAACAGGAGA GCGCACGAGG 
2 2 81 GAGCTTCCAG GGGGAAACGC CTGGTATCTT TATAGTCCTG TCGGGTTTCG CCACCTCTGA 
034V CTTGAGCGTC GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA AAACGCCAGC 
24 01 AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT GTTCTTTCCT 
2461 GCGTTATCCC CTGATTCTGT GGATAACCGT ATTACCGCTA GCATGGATCT CGGGGACGTC 
2 521 TAACTACTAA GCGAGAGTAG GGAACTGCCA GGCATCAAAT AAAACGAAAG GCTCAGTCGG 
2 581 AAGACTGGGC CTTTCGTTTT ATCTGTTGTT TGTCGGTGAA CGCTCTCCTG AGTAGGACAA 
2 641 ATCCGCCGGG AGCGGATTTG AACGTTGTGA AGCAACGGCC CGGAGGGTGG CGGGCAGGAC 
2 7 01 GCCCGCCATA AACTGCCAGG CATCAAACTA AGCAGAAGGC CATC 



Figure ^/A:pE>tsr^_ Native Protein Expression in E, coli 
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pDESTl 6464 bp 



Location (Base Nos . ) 
216 . .257 
397 . . 
647 . 
1426 
1648 
1994 
2598 



273 
1306 
. . 1510 
. . 1953 
. .2118 
. .3503 
4104 . .4264 
4504 . .4941 
5340 . . 6420 



Gene Encoded 
Trc promoter 
attRl 
CmR 

inactivated ccdA 
ccdB 
att'R2 
ampR 
ori 
f lori 
laclq 



(fl intergenic region) 



1 GTTTG AC AG C TTATCATCGA CTGCACGGTG CACCAATGCT TCTGGCGTCA GGCAGCCATC 
61 GGAAGCTGTG GTATGGCTGT GCAGGTCGTA AATCACTGCA TAATTCGTGT CGCTCAAGGC 
121 GCACTCCCGT TCTGGATAAT GTTTTTTGCG CCGACATCAT AACGGTTCTG GCAAATATTC 
181 TGAAATGAGC TGTTGACAAT TAATCATCCG GTCCGTATAA TCTGTGGAAT TGTGAGCGGG 
241 ATAACAATTT CATCGCGAGG TACCAAGCTA T C AC AAGTTT GTACAAAAAA GCTGAACGAG 
3 01 AAACGTAAAA TGATATAAAT AT C AATAT AT TAAATTAGAT TTTGCATAAA AAACAGACTA 
361 CATAATACTG TAAAACACAA CATATCCAGT CACTATGGCG GCCGCTAAGT TGGCAGCATC 
421 ACCCGACGCA CTTTGCGCCG AATAAATACC TGTGACGGAA GATCACTTCG CAGAATAAAT 
481 AAATCCTGGT GTCCCTGTTG ATACCGGGAA GCCCTGGGCC AACTTTTGGC GAAAATGAGA 
541 CGTTGATCGG CACGTAAGAG GTTCCAACTT TCACCATAAT GAAATAAGAT CACTACCGGG 
6 01 CGTATTTTTT GAGTTATCGA GATTTTCAGG AGCTAAGGAA GCTAAAATGG AGAAAAAAAT 
661 CACTGGATAT ACCACCGTTG ATATATCCCA ATGGCATCGT AAAGAACATT TTGAGGCATT 
721 TCAGTCAGTT GCTCAATGTA CCTATAACCA GACCGTTCAG CTGGATATTA CGGCCTTTTT 
781 AAAGACCGTA AAGAAAAATA AGCACAAGTT TTATCCGGCC TTTATTCACA TTCTTGCCCG 
841 CCTGATGAAT GCTCATCCGG AATTCCGTAT GGCAATGAAA GACGGTGAGC TGGTGATATG 
9 01 GGATAGTGTT CACCCTTGTT ACACCGTTTT CCATGAGCAA ACTGAAACGT TTTCATCGCT 
961 CTGGAGTGAA TACCACGACG ATTTCCGGCA GTTTCTACAC ATATATTCGC AAGATGTGGC 
1021 GTGTTACGGT GAAAACCTGG CCTATTTCCC TAAAGGGTTT ATTGAGAATA TGTTTTTCGT 
1081 CTCAGCCAAT CCCTGGGTGA GTTTCACCAG TTTTGATTTA AACGTGGCCA ATATGGACAA 
1141 CTTCTTCGCC CCCGTTTTCA CCATGGGCAA AT ATT AT AC G CAAGGCGACA AGGTGCTGAT 
12 01 GCCGCTGGCG ATTCAGGTTC ATCATGCCGT CTGTGATGGC TTCCATGTCG GCAGAATGCT 
12 61 TAATGAATTA CAACAGTACT GCGATGAGTG GCAGGGCGGG GCGTAAACGC GTGGATCCGG 
1321 CTTACTAAAA GCCAGATAAC AGTATGCGTA TTTGCGCGCT GATTTTTGCG GTATAAGAAT 
1381 ATATACTGAT ATGTATACCC GAAGTATGTC AAAAAGAGGT GTGCTATGAA GCAGCGTATT 
14 41 ACAGTGACAG TTGACAGCGA C AG CTAT C AG TTGCTCAAGG CATATATGAT GTCAATATCT 
1501 CCGGTCTGGT AAGCACAACC ATGCAGAATG AAGCCCGTCG TCTGCGTGCC GAACGCTGGA 
1561 AAGCGGAAAA TCAGGAAGGG ATGGCTGAGG TCGCCCGGTT TATTGAAATG AACGGCTCTT 
1621 TTGCTGACGA GAACAGGGAC TGGTGAAATG CAGTTTAAGG TTTACACCTA TAAAAGAGAG 
16 81 AGCCGTTATC GTCTGTTTGT GGATGTACAG AGTGATATTA TTGACACGCC CGGGCGACGG 
1741 ATGGTGATCC CCCTGGCCAG TGCACGTCTG CTGTCAGATA AAGTCTCCCG TGAACTTTAC 
18 01 CCGGTGGTGC ATATCGGGGA TGAAAGCTGG CGCATGATGA CCACCGATAT GGCCAGTGTG 
V8 61 CCGGTCTCCG TTATCGGGGA AGAAGTGGCT GATCTCAGCC ACCGCGAAAA TGACATCAAA 
1921 AACGC CATTA ACCTGATGTT CTGGGGAATA TAAATGTCAG GCTCCCTTAT ACAC AG C C AG 
1981 TCTGCAGGTC G AC C ATAGTG ACTGGATATG TTGTGTTTTA CAGTATTATG TAGTCTGTTT 
2 041^ TTTATGCAAA ATCTAATTTA ATATATTGAT ATTTATATCA TTTTACGTTT CTCGTTCAGC 
2101 TTTCTTGTAC AAAGTGGTGA TAGCTTGGCT GTTTTGGCGG ATGAGAGAAG ATTTTCAGCC 
2161 TGATACAGAT TAAATCAGAA CGCAGAAGCG GTCTGATAAA ACAGAATTTG CCTGGCGGCA 
2221 GTAGCGCGGT GGTCCCACCT GACCCCATGC CGAACTCAGA AGTGAAACGC CGTAGCGCCG 
22 81 ATGGTAGTGT GGGGTCTCCC CATGCGAGAG TAGGGAACTG CCAGGCATCA AATAAAACGA 
2341 AAGGCTCAGT CGAAAGACTG GGCCTTTCGT TTTATCTGTT GTTTGTCGGT GAACGCTCTC 
24 01 C TGAGT AGG A CAAATCCGCC GGGAGCGGAT TTGAACGTTG CGAAGCAACG GCCCGGAGGG 
24 61 TGGCGGGCAG GACGCCCGCC ATAAACTGCC AGGCATCAAA TTAAGCAGAA GGCCATCCTG 
2 521 ACGGATGGCC TTTTTGCGTT TCTACAAACT CTTTTTGTTT ATTTTTCTAA ATACATTCAA- 




2 581 ATATGTATCC GCTCATGAGA CAATAACCCT 
2 641 AGAGTATGAG TATTCAACAT TTCCGTGTCG 
27 01 TTCCTGTTTT TGCTCACCCA GAAACGCTGG 
2761 GTGCACGAGT GGGTTACATC GAACTGGATC 
2821 GCCCCGAAGA ACGTTTTCCA ATGATGAGCA 
2881 TATCCCGTGT TGACGCCGGG CAAGAGCAAC 

2 941 ACTTGGTTGA GTACTCACCA GTCACAGAAA 
3001 AATTATGCAG TGCTGCCATA AC C ATGAGTG 

3 061 CGATCGGAGG ACCGAAGGAG CTAACCGCTT 
3121 GCCTTGATCG TTGGGAACCG GAG CTGAATG 
3181 CGATGCCTAC AG CAATGGC A ACAACGTTGC 
3 241 TAGCTTCCCG GCAACAATTA ATAGACTGGA 

33 01 TGCGCTCGGC CCTTCCGGCT GGCTGGTTTA 
3361 GGTCTCGCGG TATCATTGCA GCACTGGGGC 
3421 TCTACACGAC GGGGAGTCAG GCAACTATGG 

34 81 GTGCCTCACT GATTAAGCAT TGGTAACTGT 
3 541 TTGATTTAAA ACTTCATTTT TAATTTAAAA 
3 6 01 TCATGACCAA AATCCCTTAA C GTG AGTTTT 
3661 AGATCAAAGG ATCTTCTTGA GATCCTTTTT 
3721 AAAAAC C AC C GCTACCAGCG GTGGTTTGTT 
3781 CGAAGGTAAC TGGCTTCAGC AGAGCGCAGA 
3 841 AGTTAGGCCA CCACTTCAAG AACTCTGTAG 
3 901 TGTTACCAGT GGCTGCTGCC AGTGGCGATA 

3 961 GATAGTTACC GGATAAGGCG CAGCGGTCGG 

4 021 GCTTGGAGCG AACGACCTAC ACCGAACTGA 
4 081 CCACGCTTCC CGAAGGGAGA AAGGCGGACA 
4141 GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA 
42 01 TTCGCCACCT CTGACTTGAG CGTCGATTTT 
42 61 GGAAAAACGC CAGCAACGCG GCCTTTTTAC 
4 321 ACATGTTCTT TCCTGCGTTA TCCCCTGATT 
4381 GAG C TG AT AC CGCTCGCCGC AGCCGAACGA 
4441 CGGAAGAGCG CCTGATGCGG TATTTTCTCC 
4 501 TAATTTTGTT AAAATTCGCG TTAAATTTTT 
4 561 CCGAAATCGG CAAAATCCCT TATAAATCAA 
4 621 TTCCAGTTTG GAACAAGAGT CCACTATTAA 
4 681 AAAC CGTCTA TCAGGGCGAT GGCCCACTAC 
4 741 GGTCGAGGTG CCGTAAAGCA CTAAATCGGA 
4 8 01 GACGGGGAAA GCCGGCGAAC GTGGCGAGAA 
48 61 CTAGGGCGCT GGCAAGTGTA GCGGTCACGC 
4 921 ATGCGCCGCT ACAGGGCGCG TCCATTCGCC 

4 981 AATCTGCTCT GATGCCGCAT AGTTAAGCCA 

5 041 TACACTCCGC TATCGCTACG TGACTGGGTC 
5101 GCTGACGCGC CCTGACGGGC TTGTCTGCTC 
5161 GTCTCCGGGA GCTGCATGTG TCAGAGGTTT 
52 21 CAGATCAATT CGCGCGCGAA GGCGAAGCGG 

'5281 GCAAAACCTT TCGCGGTATG GCATGATAGC 
5341 ATGTGAAACC AGTAACGTTA TACGATGTCG 
54 01 TTTCCCGCGT GGTGAAC C AG GCCAGCCACG 

54 61-=: CGGCGATGGC GGAGCTGAAT TACATTCCCA 
5521 AGTCGTTGCT GATTGGCGTT GCCACCTCCA 

55 81 TCGCGGCGAT TAAATCTCGC GCCGATCAAC 
5641 AACGAAGCGG CGTCGAAGCC TGTAAAGCGG 
57 01 GTGGGCTGAT CATTAACTAT CCGCTGGATG 

57 61 GCACTAATGT TCCGGCGTTA TTTCTTGATG 
5821 TTTTCTCCCA TGAAGACGGT ACGCGACTGG 

58 81 AGCAAATCGC GCTGTTAGCG GGCCCATTAA 

5 94 1 GCTGGCATAA ATATCTCACT CGCAATCAAA 

6 001 GGAGTGCCAT GTC CGGTTTT CAACAAACCA 




GATAAATGCT TCAATAATAT TGAAAAAGGA 
CCCTTATTCC CTTTTTTGCG GCATTTTGCC 
TGAAAGTAAA AG ATG C TG AA GAT C AGTTGG 
TCAACAGCGG TAAGATCCTT GAGAGTTTTC 
CTTTTAAAGT TCTGCTATGT GGCGCGGTAT 
TCGGTCGCCG CATACACTAT TCTCAGAATG 
AGCATCTTAC GGATGGCATG ACAGTAAGAG 
ATAACACTGC GGCCAACTTA CTTCTGACAA 
TTTTGCACAA CATGGGGGAT CATGTAACTC 
AAG C CAT AC C AAACGACGAG CGTGACACCA 
GCAAACTATT AACTGGCGAA CTACTTACTC 
TGGAGGCGGA T AAAGTTG C A GGACCACTTC 
TTGCTGATAA ATCTGGAGCC GGTGAGCGTG 
CAGATGGTAA GCCCTCCCGT ATCGTAGTTA 
ATGAACGAAA TAG AC AG AT C GCTGAGATAG 
CAGACCAAGT TTAC T CAT AT ATACTTTAGA 
GGATCTAGGT GAAGATCCTT TTTGATAATC 
CGTTCCACTG AGCGTCAGAC CCCGTAGAAA 
TTCTGCGCGT AATCTGCTGC TTGCAAACAA 
TGCCGGATCA AGAGCTACCA ACTCTTTTTC 
T AC C AAAT AC TGTCCTTCTA GTGTAGCCGT 
CACCGCCTAC ATACCTCGCT CTGCTAATCC 
AGTCGTGTCT TACCGGGTTG GACTCAAGAC 
GCTGAACGGG GGGTTCGTGC ACACAGCCCA 
GATACCTACA GCGTGAGCTA TGAGAAAGCG 
GGTATCCGGT AAGCGGCAGG GTCGGAACAG 
ACGCCTGGTA TCTTTATAGT CCTGTCGGGT 
TGTGATGCTC GTCAGGGGGG CGGAGCCTAT 
GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC 
CTGTGGATAA CCGTATTACC GCCTTTGAGT 
CCGAGCGCAG CGAGTCAGTG AG CGAG GAAG 
TTACGCATCT GTGCGGTATT TCACACCGCA 
GTTAAATCAG CTCATTTTTT AACCAATAGG 
AAGAATAGAC CGAGATAGGG TTGAGTGTTG 
AGAACGTGGA CTCCAACGTC AAAGGGCGAA 
GTGAACCATC ACCCTAATCA AGTTTTTTGG 
ACCCTAAAGG GAGCCCCCGA TTTAGAGCTT 
AGGAAGGGAA GAAAGCGAAA GGAGCGGGCG 
TGCGCGTAAC CACCACACCC GCCGCGCTTA 
ATTCAGGCTG CTATGGTGCA CTCTCAGTAC 
GTACCAGTCA CGTAGCGATA TCGGAGTGTA 
ATGGCTGCGC CCCGACACCC GCCAACACCC 
CCGGCATCCG CTTACAGACA AGCTGTGACC 
TCACCGTCAT CACCGAAACG CGCGAGGCAG 
CATGCATTTA CGTTGACACC ATCGAATGGT 
GCCCGGAAGA GAGTCAATTC AGGGTGGTGA 
CAGAGTATGC CGGTGTCTCT TATCAGACCG 
TTTCTGCGAA AACGCGGGAA AAAGTGGAAG 
ACCGCGTGGC ACAACAACTG GCGGGCAAAC 
GTCTGGCCCT GCACGCGCCG TCGCAAATTG 
TGGGTGCCAG CGTGGTGGTG TCGATGGTAG 
CGGTGCACAA TCTTCTCGCG CAACGCGTCA 
ACCAGGATGC CATTGCTGTG GAAGCTGCCT 
TCTCTGAC C A GACACCCATC AACAGTATTA 
GCGTGGAGCA TCTGGTCGCA TTGGGTCACC 
GTTCTGTCTC GGCGCGTCTG CGTCTGGCTG 
TTCAGCCGAT AGCGGAACGG GAAGGCGACT 
TGCAAATGCT GAATGAGGGC ATCGTTCCCA- 



6 061 CTGCGATGCT GGTTGCCAAC GATCAGATGG 

6121 CCGGGCTGCG CGTTGGTGCG GATATCTCGG 

6181 CATGTTATAT CCCGCCGTTA ACCACCATCA 

6 241 GCGTGGACCG CTTGCTGCAA CTCTCTCAGG 

6 3 01 CCGTCTCACT GGTGAAAAGA AAAACCACCC 

6361 GCGCGTTGGC CGATTCATTA ATGCAGCTGG 

6421 AGTGAGCGCA ACGCAATTAA TGTGAGTTAG 



CGCTGGGCGC AATGCGCGCC ATTACCGAGT 
TAGTGGGATA C G AC GAT AC C GAAGACAGCT 
AACAGGATTT TCGCCTGCTG GGGCAAACCA 
GCCAGGCGGT GAAGGGCAAT CAGCTGTTGC 
TGGCACCCAA TACGCAAACC GCCTCTCCCC 
CACGACAGGT TTCCCGACTG GAAAGCGGGC 
CGCGAATTGA TCTG 



Figure p>)xST2- His6 fusions in E. coli 
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pDEST2 6553 bp 



Location (Base Nos . ) 
912 . . 962 
1223 . 
1473 . 
2252 . 
2474 . 
2820. 
3509 , 
5015 
5415 
6225 



1009 
2132 
2336 
2779 
,2944 
, 4414 
.5175 
.5852 
.752 



Gene Encoded 

Trc 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

ori 

f lori (f 1 intergenic region) 
laclq 



1 GGCGGTGCAC AATCTTCTCG CGCAACGCGT CAGTGGGCTG ATCATTAACT ATCCGCTGGA 
61 TG AC C AGG AT GCCATTGCTG TGGAAGCTGC CTGCACTAAT GTTCCGGCGT TATTTCTTGA 
121 TGTCTCTGAC CAGACACCCA TCAACAGTAT TATTTTCTCC CATGAAGACG GTACGCGACT 
181 GGGGGTGGAG CATCTGGTCG CATTGGGTCA C C AG C AAAT C GCGCTGTTAG CGGGCCCATT 
241 AAGTTCTGTC TCGGCGCGTC TGCGTCTGGC TGGCTGGCAT AAATATCTCA CTCGCAATCA 
3 01 AATTCAGCCG ATAG CGGAAC GGGAAGGCGA CTGGAGTGCC ATGTC CGGTT TTCAACAAAC 
361 CATGCAAATG CTGAATGAGG GCATCGTTCC CACTGCGATG CTGGTTGCCA AC GAT C AG AT 
421 GGCGCTGGGC GCAATGCGCG CCATTACCGA GTCCGGGCTG CGCGTTGGTG CGGATATCTC 
481 GGTAGTGGGA TACGACGATA C CGAAGAC AG CTCATGTTAT ATCCCGCCGT CAACCACCAT 
541 CAAACAGGAT TTTCGCCTGC TGGGGCAAAC CAGCGTGGAC CGCTTGCTGC AACTCTCTCA 
6 01 GGGCCAGGCG GTGAAGGGCA ATCAGCTGTT GCCCGTCTCA CTGGTGAAAA G AAAAAC C AC 
661 CCTGGCACCC AAT AC G C AAA CCGCCTCTCC CCGCGCGTTG GCCGATTCAT TAATGCAGCT 
721 GGCACGACAG GTTTCCCGAC TGGAAAGCGG GCAGTGAGCG CAACGCAATT AATGTGAGTT 
781 AGCGCGAATT GATCTGGTTT GACAGCTTAT CATCGACTGC ACGGTGCACC AATGCTTCTG 
841 GCGTCAGGCA GCCATCGGAA GCTGTGGTAT GGCTGTGCAG GTCGTAAATC AC TG CAT AAT 
9 01 TCGTGTCGCT CAAGGCGCAC TCCCGTTCTG GATAATGTTT TTTGCGCCGA CAT C AT AAC G 
961 GTTCTGGCAA AT ATT C TG AA ATGAGCTGTT GACAATTAAT CATCCGGTCC GTATAAT CTG 
1021 TGGAATTGTG AG C GGATAAC AATTTCACAC AGGAAACAGA CCATGTCGTA CTACCATCAC 
1081 CATCACCATC ACGGCATCAC AAGTTTGTAC AAAAAAGCTG AACGAGAAAC GTAAAATGAT 
1141 AT AAAT AT C A ATATATTAAA TTAGATTTTG CATAAAAAAC AGACTACATA ATACTGTAAA 
12 01 ACACAACATA TCCAGTCACT ATGGCGGCCG CTAAGTTGGC AGCATCACCC GACGCACTTT 
12 61 GCGCCGAATA AAT AC CTGTG ACGGAAGATC ACTTCGCAGA ATAAATAAAT CCTGGTGTCC 
1321 CTGTTGATAC CGGGAAGCCC TGGGCCAACT TTTGGCGAAA ATGAGACGTT GATCGGCACG 
1381 TAAGAGGTTC CAACTTTCAC CATAATGAAA TAAGATCACT ACCGGGCGTA TTTTTTGAGT 
14 41 TATCGAGATT TTCAGGAGCT AAGGAAGCTA AAATGGAGAA AAAAATCACT GGATATACCA 
1501 CCGTTGATAT ATCCCAATGG CATCGTAAAG AACATTTTGA GGCATTTCAG TCAGTTGCTC 
1561 AATGTAC CTA TAACCAGACC GTTCAGCTGG ATATTACGGC CTTTTTAAAG ACCGTAAAGA 
1621 AAAATAAGCA CAAGTTTTAT CCGGCCTTTA TTCACATTCT TGCCCGCCTG ATGAATGCTC 
16 81 ATCCGGAATT C CGTATGGC A ATGAAAGACG GTGAGCTGGT GATATGGGAT AGTGTTCACC 
1741 C TTGTT AC AC CGTTTTCCAT GAGCAAACTG AAACGTTTTC ATCGCTCTGG AGTGAATACC 
,18 01 AC G AC G ATTT CCGGCAGTTT CTACACATAT ATTCGCAAGA TGTGGCGTGT TACGGTGAAA 
18 61 ACCTGGCCTA TTTCCCTAAA GGGTTTATTG AGAATATGTT TTTCGTCTCA GCCAATCCCT 
1921 GGGTGAGTTT C AC C AGTTTT GATTTAAACG TGGCCAATAT GGACAACTTC TTCGCCCCCG 
1981 TTTTCACCAT GGGCAAATAT TATACGCAAG GCGACAAGGT GCTGATGCCG CTGGCGATTC 
2 041- AGGTTCATCA TGCCGTCTGT GATGGCTTCC ATGTCGGCAG AATG CTTAAT GAATTACAAC 
2101 AGTACTGCGA TGAGTGGCAG GGCGGGGCGT AAACGCGTGG ATCCGGCTTA CTAAAAGCCA 
2161 GATAACAGTA TGCGTATTTG CGCGCTGATT TTTGCGGTAT AAGAATATAT ACTGATATGT 
2221 ATACCCGAAG TATGTCAAAA AGAGGTGTGC TATGAAGCAG CGTATTACAG TGACAGTTGA 
2281 CAGCGACAGC TATCAGTTGC TCAAGGCATA TATGATGTCA ATATCTCCGG TCTGGTAAGC 
2341 ACAACCATGC AGAATGAAGC CCGTCGTCTG CGTGCCGAAC GCTGGAAAGC GGAAAATCAG 
24 01 GAAGGGATGG CTGAGGTCGC CCGGTTTATT GAAATGAACG GCTCTTTTGC TGACGAGAAC 
24 61 AGGGACTGGT GAAATGCAGT TTAAGGTTTA C AC CTATAAA AG AG AG AG C C GTTATCGTCT 
2 521 GTTTGTGGAT GTACAGAGTG ATATTATTGA CACGCCCGGG CGACGGATGG TGATCCCCCT- 





2581 GGCCAGTGCA CGTCTGCTGT CAG = CTC„ AGTGTGCCGG ^CCgSS 
2641 CGGGGATGAA AGCTGGCGCA ^ATGACCAC ATC AAAAACG CCATTAACCT 

2701 cggggaagaa gtggctgatc tcagccaccg cgaaaa AGCCAGTC tg caggtcgacc 

2761 gatgttctgg ggaatataaa tgtcaggctc cct CTGTTTTT ta tgcaaaatct 

2821 atagtgactg gatatgttgt ^tttacagt ^tatg ttcagctttc ttgtacaaag 

2881 aatttaatat attgatattt JTAtcatttt ^gt agctcactag tcgcggccgc 

2941 tggtgatgcc catatgggaa ttcaaaggcc ^acgt tgttttggcg gatgagagaa 

3001 ttctagagga tccctcgagg catgcggtac caagci gataa aacagaattt 

3061 GATTTTCAGC CTGATACAGA TTAAATCAGA JCGCAGAAGC ^ 

31 21 GCCTGGCGGC AGTAGCGCGG TGGTCCCACC TGACCCC^ G CCAGGCATC 

3181 CCGTAGCGCC GATGGTAGTG TGGGGTCTCC TTTTATCTGT TGTTTGTCGG 

3241 aaataaaacg aaaggctcag tcgaaagact gggcctttc tt gcgaagcaac 

3301 tgaacgctct cctgagtagg acaaatccgc cgggagcgga atcaa attaagcaga 

3 361 GGCCCGGAGG GTGGCGGGCA GGACGCCCGC CATAAAC^ tctttttgtt TATTTT TCTA 
3421 AGGCCATCCT GACGGATGGC CTTTTTGCGT TTCTACAAA c TTCAATAATA 

3481 AATACATTCA AATATGTATC CGCTCATGAG ^TAACCC ^ CCTTTTTTGC 

3541 TTGAAAAAGG AAGAGTATGA GTATTCAACA TTTCCG GTGAAAGTAA AAGATGCTGA 

3601 GGCATTTTGC CTTCCTGTTT TTGCTCACCC J^GC^ GTAAGATC ct 
3661 AGATCAGTTG GGTGCACGAG TGGGTTACAT ^ AC TTTTAAAG TTCTGCTATG 

3721 TGAGAGTTTT CGCCCCGAAG ^GTTTTCC CTCGGTC GCC GCATACACTA 

3781 TGGCGCGGTA TTATCCCGTG TTGACGCCGG GCAAGAGC ^gcaTCTTA CGGATGGCAT 
384X TTCTCAGAAT GACTTGGTTG AGTACTCACC AGTCACAGAA ^ CGGCCAACTT 

3 9 01 GACAGTAAGA GAATTATGCA ^GCTGCCAT JACCATG TTTTTGCACA ACATGGGGGA 
3 961 ACTTCTGACA ACGATCGGAG GACCGAAGGA ^TAACL GAAGCCATAC CAAACGACGA 
4021 TCATGTAACT CGCCTTGATC ^TGGGAACC <™g£^ GGGAAA CTAT TAACTGGCGA 
4081 GCGTGACACC ACGATGCCTA CAGCAATGGC ^AAC A TGGAGGCGG ATAAAGTTGC 

4141 actacttact ctagcttccc ggcaacaatt ^agac tgata ^tcTGGAGC 

4201 AGGACCACTT CTGCGCTCGG CCCTTCCGGC ™f™ GG AGATGGTA AGCCCTCCCG 

4261 cggtgagcgt gggtctcgcg gtatcattgc agcactg gatgaacgaa atagacagat 
4 32i tatcgtagtt atctacacga cggggagtca ggcaa TCAGACC aag tttactcata 

4381 CGCTGAGATA GGTGCCTCAC TGATTAAGCA TTGGTAACTG ^ TGAAGATCCT 

4441 TATACTTTAG ATTGATTTAA AACTTCATTT JTAATT TCGTTCCACT GAGCGTCAGA 

4501 TTTTGATAAT CTCATGACCA AAATCCCTTA JCGTG TTTCTGCGCG TAATCTGCTG 

4561 CCCCGTAGAA AAGATCAAAG GATCTTCTTG ^ATCCTl c ^GCTACC 

4 4 5 62i cttgcaaaca aaaaaaccac cgctaccagc 3 a TGGttT a T ataccaaata c^tccttct 
4681 aactcttttt ccgaaggtaa ctggcttcag c * GC accgccta catacctcgc 

4741 agtgtagccg tagttaggcc ^cacttcaa ^gtcgtgtc ttaccgggtt 

4801 tctgctaatc ctgttaccag tggctgctgc cag GGCTGAACG g ggggttcgtg 

4861 ggactcaaga cgatagttac cggataaggc f™f agatacctac AGC gtgagct 

4921 CACACAGCCC AGCTTGGAGC OAAC^OTO A GGTATCCGG TAAGCGGCAG 

4981 ATGAGAAAGC GCCACGCTTC CCGAAGGGAG ^ AAGGG CTGGT ATCTTTATAG 

5041 GGTCGGAACA GGAGAGCGCA ^GAGGGAGCT ™C TTGTGATGCT CGTCAGGGGG 

5101 TCCTGTCGGG TTTCGCCACC TCTGACTTGA ^GTCG cqgTTCCTGG CCTTTTGCTG 

5161 GCGGAGCCTA TGGAAAAACG CCAGCAACGC JGCCTTTT^ accqtatta c 
5221 GCCTTTTGCT CACATGTTCT TTCCTGCGTT ^ ACCGAGCGCA GCGAGTCAGT 

' %281 CGCCTTTGAG TGAGCTGATA CCGCTCGCCG cxTACGCATC TGTGCGGTAT 

I34I GAGCGAGGAA GCGGAAGAGC ^TGATGCG GTATTTTCTC C ^ GG <pCATTTTT 

5401 TTCACACCGC ATAATTTTGT TAAAATTCGC ^TAAA^ CCGAGATAGG 
5461.TAACCAATAG GCCGAAATCG GCAAAATCCC TTATAAA AC TCCAACGT 
5521 GTTGAGTGTT GTTCCAGTTT GGAACAAGAG ?ff™ CGTGAACCAT CACCCTAATC 
5581 CAAAGGGCGA AAAACCGTCT ^AGGGCGA JGGCCC ^ct^c GGAGCCCCCG 

5641 AAGTTTTTTG GGGTCGAGGT GCCGTAAAGC JCTAAA a g AAA GCGAA 

57 01 ATTTAGAGCT TGACGGGGAA AGCCGGCGAA ^ CTGCGCGTAA CCACCACACC 
5 5761 AGGAGCGGGC GCTAGGGCGC TGGCAAGTGT JGCGGTCACG ? ^ tgctatggtg 
5821 CGCCGCGCTT AATGCGCCGC JACAGGGCGC CAGTATACAC TCCGCTATCG 

58 81 CACTCTCAGT ACAATCTGCT CTGATGCCGC ^ CAC CCGCTGA CGCGCCCTGA 

SSSC ~= TC TGACCGTCTC C—SC 



6 061 ATGTGTCAGA GGTTTTCACC GTCATCACCG 
6121 GCGAAGGCGA AGCGGCATGC ATTTACGTTG 
6181 GTATGGCATG ATAGCGCCCG GAAGAGAGTC 
6241 CGTTATACGA TGTCGCAGAG TATGCCGGTG 

63 01 ACCAGGCCAG CCACGTTTCT GCGAAAACGC 
6361 TGAATTACAT TCCCAACCGC GTGGCACAAC 
6421 GCGTTGCCAC CTCCAGTCTG GCCCTGCACG 

64 81 CTCGCGCCGA TCAACTGGGT GCCAGCGTGG 
6541 AAGCCTGTAA AGC 



AAACGCGCGA GGCAGCAGAT CAATTCGCGC 
ACACCATCGA ATGGTG C AAA ACCTTTCGCG 
AATTCAGGGT GGTGAATGTG AAAC CAGTAA 
TCTCTTATCA GACCGTTTCC CGCGTGGTGA 
GGGAAAAAGT GGAAGCGGCG ATGGCGGAGC 
AACTGGCGGG CAAACAGTCG TTGCTGATTG 
CGCCGTCGCA AATTGTCGCG GCGATTAAAT 
TGGTGTCGAT GGTAGAACGA AGCGGCGTCG 





Figure 2#; -p>tST3 GST fusions in E. coli 
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pDEST3 6823 bp 



Location (Base Nos . ) 



150 . 
1087 
1337 
2116 
2338 
2684 
3231 
5295 



200 
. 963 
. 1996 
.2200 
.2643 
.2808 
.4091 
.6254 



Gene Encoded 

Trc 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

arnpR 

laclq 



1 ACGTTATCGA CTGCACGGTG CACCAATGCT TCTGGCGTCA GGCAGCCATC GGAAGCTGTG 
61 GTATGGCTGT GCAGGTCGTA AATCACTGCA TAATTCGTGT CGCTCAAGGC GCACTCCCGT 
121 TCTGGATAAT GTTTTTTGCG C CGAC AT CAT AACGGTTCTG GCAAATATTC TGAAATGAGC 
181 TGTTGACAAT TAATCATCGG CTCGTATAAT GTGTGGAATT GTGAGCGGAT AACAATTTCA 
241 CACAGGAAAC AGTATTCATG TCCCCTATAC TAGGTTATTG GAAAATTAAG GGCCTTGTGC 
3 01 AACCCACTCG ACTTCTTTTG GAATATCTTG AAGAAAAATA TG AAG AG CAT TTGTATGAGC 

3 61 GCGATGAAGG TGATAAATGG CGAAACAAAA AGTTTGAATT GGGTTTGGAG TTTCCCAATC 
421 TTC CTTATTA TATTGATGGT GATGTTAAAT TAACACAGTC TATGGCCATC ATACGTTATA 

4 81 TAG CTGAC AA GCACAACATG TTGGGTGGTT GTCCAAAAGA GCGTGCAGAG ATTTCAATGC 
541 TTGAAGGAGC GGTTTTGGAT ATTAGATACG GTGTTTCGAG AATTGCATAT AGTAAAGACT 
6 01 TTGAAACTCT CAAAGTTGAT TTTCTTAGCA AGCTACCTGA AATGCTGAAA ATGTTCGAAG 
661 ATCGTTTATG T C AT AAAAC A TATTTAAATG GTGATCATGT AACCCATCCT GACTTCATGT 
721 TGTATG AC G C TCTTGATGTT GTTTTATACA TGGACCCAAT GTGCCTGGAT GCGTTCCCAA 
781 AATTAGTTTG TTTTAAAAAA CGTATTGAAG CTATCCCACA AATTGATAAG TACTTGAAAT 
841 CCAGCAAGTA TAT AG CATGG CCTTTGCAGG GCTGGCAAGC CACGTTTGGT GGTGGCGACC 
901 ATCCTCCAAA ATCGGATCTG GTTCCGCGTG GATCTCGTCG TGCATCTGTT GGATCCCCAT 
961 CAACAAGTTT GTACAAAAAA GCTGAACGAG AAACGTAAAA TGATATAAAT AT C AATATAT 

1021 TAAATTAGAT TTTGCATAAA AAACAGACTA CATAATACTG TAAAACACAA CATATC CAGT 
1081 CACTATGGCG GCCGCTAAGT TGGCAGCATC ACCCGACGCA CTTTGCGCCG AATAAATACC 
1141 TGTGACGGAA GATCACTTCG CAGAATAAAT AAATCCTGGT GTCCCTGTTG ATACCGGGAA 
12 01 GCCCTGGGCC AACTTTTGGC GAAAATGAGA CGTTGATCGG C AC GTAAG AG GTTCCAACTT 

12 61 TCACCATAAT GAAATAAGAT CACTACCGGG CGTATTTTTT GAGTTATCGA GATTTTCAGG 
1321 AGCTAAGGAA GCTAAAATGG AGAAAAAAAT CACTGGATAT ACCACCGTTG ATATATCCCA 

13 81 ATGGCATCGT AAAGAACATT TTGAGGCATT TCAGTCAGTT GCTCAATGTA CCTATAACCA 

14 41 GACCGTTCAG CTGGATATTA CGGCCTTTTT AAAGACCGTA AAGAAAAATA AGCACAAGTT 

15 01 TTATCCGGCC TTTATTCACA TTCTTGCCCG CCTGATGAAT GCTCATCCGG AATTCCGTAT 

15 61 GGCAATGAAA GACGGTGAGC TGGTGATATG GGATAGTGTT CACCCTTGTT AC AC CGTTTT 
1621 CCATGAGCAA ACTGAAACGT TTTCATCGCT CTGGAGTGAA TACCACGACG ATTTCCGGCA 

16 81 GTTTCTACAC ATATATTCGC AAGATGTGGC GTGTTACGGT G AAAAC CTGG CCTATTTCCC 
1741 TAAAGGGTTT ATTGAGAATA TGTTTTTCGT CTCAGCCAAT CCCTGGGTGA GTTTC AC C AG 
18 01 TTTTGATTTA AACGTGGCCA ATATGGACAA CTTCTTCGCC CCCGTTTTCA CCATGGGCAA 
18 61 ATATTATACG CAAGGCGACA AGGTGCTGAT GCCGCTGGCG ATTCAGGTTC ATCATGCCGT 

/1921 CTGTGATGGC TTCCATGTCG GCAGAATGCT TAATGAATTA CAACAGTACT GCGATGAGTG 
1981 GCAGGGCGGG GCGTAAAGAT CTGGATCCGG CTTACTAAAA GCCAGATAAC AGTATGCGTA 
2 041 TTTGCGCGCT GATTTTTGCG GTATAAGAAT ATATACTGAT ATGTATACCC GAAGTATGTC 
2101 AAAAAGAGGT GTGCTATGAA GCAGCGTATT ACAGTGACAG TTGACAGCGA CAGCTATCAG 
2161- TTGCTCAAGG CATATATGAT GTCAATATCT CCGGTCTGGT AAGCACAACC ATGCAGAATG 
2221 AAGCCCGTCG TCTGCGTGCC GAACGCTGGA AAGCGGAAAA TCAGGAAGGG ATGGCTGAGG 
22 81 TCGCCCGGTT TATTGAAATG AACGGCTCTT TTGCTGACGA GAACAGGGAC TGGTGAAATG 
2 341 CAGTTTAAGG TTTACACCTA TAAAAGAGAG AGCCGTTATC GTCTGTTTGT GGATGTACAG 
24 01 AGTGATATTA TTGACACGCC CGGGCGACGG ATGGTGATCC CCCTGGCCAG TGCACGTCTG 
24 61 CTGTCAGATA AAGTCTCCCG TGAACTTTAC CCGGTGGTGC ATATCGGGGA TGAAAGCTGG 
2 521 CGCATGATGA C C AC CGATAT GGCCAGTGTG CCGGTCTCCG TTATCGGGGA AGAAGTGGCT 
2 581 GATCTCAGCC ACCGCGAAAA TG AC AT C AAA AACGCCATTA ACCTGATGTT CTGGGGAATA 
2 641 TAAATGTCAG GCTCCCTTAT AC AC AG C C AG TCTGCAGGTC GAC CATAGTG ACTGGATATG - 



27 01 TTGTGTTTTA CAGTATTATG TAGTCTGTTT TTTATG C AAA AT C TAATTT A ATATATTGAT 
2761 ATTTATATCA TTTTACGTTT CTCGTTCAGC TTTCTTGTAC AAAGTGGTTG ATGGGAATTC 
2821 ATCGTGACTG ACTGACGATC TGCCTCGCGC GTTTCGGTGA TGACGGTGAA AACCTCTGAC 

28 81 ACATGCAGCT CCCGGAGACG GTC AC AG CTT GTCTGTAAGC GGATGCCGGG AGCAGACAAG 

2 941 CCCGTCAGGG CGCGTCAGCG GGTGTTGGCG GGTGTCGGGG CGCAGCCATG ACCCAGTCAC 

3 0 01 GTAG C GAT AG CGGAGTGTAT AATTCTTGAA GACGAAAGGG CCTCGTGATA CGCCTATTTT 
3 061 TATAGGTTAA TGTCATGATA ATAATGGTTT CTTAGACGTC AGGTGGCACT TTTCGGGGAA 
3121 ATGTGCGCGG AACCCCTATT TGTTTATTTT TCTAAATACA TTCAAATATG TATCCGCTCA 
3181 TGAGACAATA ACCCTGATAA ATGCTTCAAT AATATTGAAA AAGGAAGAGT ATGAGTATTC 
3241 AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT TTGCCTTCCT GTTTTTGCTC 
3301 ACCCAGAAAC GCTGGTGAAA GTAAAAGATG CTGAAGATCA GTTGGGTGCA CGAGTGGGTT 
3361 ACATCGAACT GGATCTCAAC AG C GGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT 
3421 TTCCAATGAT GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 
34 81 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG GTTGAGTACT 
3 541 CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT AAGAGAATTA TGCAGTGCTG 
3601 C C AT AAC CAT GAGTGATAAC ACTGCGGCCA ACTTACTTCT GACAACGATC GGAGGACCGA 
3661 AGGAGCTAAC CGCTTTTTTG CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG 
3 721 AACCGGAGCT GAATGAAGCC AT AC C AAAC G ACGAGCGTGA CACCACGATG CCTGCAGCAA 
3781 TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT TCCCGGCAAC 
3 841 AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC ACTTCTGCGC TCGGCCCTTC 

3 901 CGGCTGGCTG GTTTATTGCT GATAAATCTG GAGCCGGTGA GCGTGGGTCT CGCGGTATCA 
c «s 3 961 TTGCAGCACT GGGGCCAGAT GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA 
% 4 021 GTCAGGCAAC TATGGATGAA C G AAAT AG AC AGATCGCTGA GATAGGTGCC TCACTGATTA 

4 081 AG C ATTGGTA ACTGTCAGAC CAAGTTTACT CATATATACT TTAGATTGAT TTAAAACTTC 
4141 ATTTTTAATT TAAAAGGATC TAGGTGAAGA TCCTTTTTGA TAATCTCATG ACCAAAATCC 

42 01 CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT AG AAAAG AT C AAAGGATCTT 
If 4261 CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC 
!!: 4 321 CAGCGGTGGT TTGTTTGCCG GATC AAGAG C T AC C AAC T C T TTTTCCGAAG GTAACTGGCT 

43 81 T C AG C AG AG C G C AG AT AC C A AATACTGTCC TTCTAGTGTA GCCGTAGTTA GGCCACCACT 
4441 TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG 

« 4 5 01 CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC AAG AC GAT AG TTACCGGATA 

p 4 561 AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA 

yj 4 621 CCTACACCGA ACTGAGATAC CTACAGCGTG AG CTATGAGA AAGCGCCACG CTTCCCGAAG 

Q 46 81 GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG 

Rj 4 741 AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC 

O 4 8 01 TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AAC G C C AG C A 

p 48 61 ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTTCCTG 

4 921 CGTTATCCCC TGATTCTGTG GATAAC CGTA TTACCGCCTT TGAGTGAGCT GATACCGCTC 

4 981 GCCGCAGCCG AACGACCGAG CGCAGCGAGT CAGTGAGCGA GGAAGCGGAA GAGCGCCTGA 

5 041 TGCGGTATTT TCTCCTTACG CATCTGTGCG GTATTTCACA CCGCATAAAT TCCGACACCA 
5101 TCGAATGGTG CAAAACCTTT CGCGGTATGG CATGATAGCG CCCGGAAGAG AGTCAATTCA 
5161 GGGTGGTGAA TGTGAAAC C A GTAACGTTAT ACGATGTCGC AGAGTATGCC GGTGTCTCTT 
52 21 ATCAGACCGT TTCCCGCGTG GTGAACCAGG CCAGCCACGT TTCTGCGAAA ACGCGGGAAA 

52 81 AAGTGGAAGC GGCGATGGCG GAGCTGAATT ACATTCCCAA CCGCGTGGCA CAACAACTGG 

53 41 CGGGCAAACA GTCGTTGCTG ATTGGCGTTG CCACCTCCAG TCTGGCCCTG CACGCGCCGT 
'54 01 CGCAAATTGT CGCGGCGATT AAATCTCGCG CCGATCAACT GGGTGCCAGC GTGGTGGTGT 

54 61 C G ATGGT AG A ACGAAGCGGC GTCGAAGCCT GTAAAGCGGC GGTGCACAAT CTTCTCGCGC 

5 521 AACGCGTCAG TGGGCTGATC ATTAAC TATC CGCTGGATGA CCAGGATGCC ATTGCTGTGG 
558 1^ AAGCTGCCTG CACTAATGTT CCGGCGTTAT TTCTTGATGT CTCTGACCAG ACACCCATCA 

56 41 ACAGTATTAT TTTCTCCCAT GAAGACGGTA CGCGACTGGG CGTGGAGCAT CTGGTCGCAT 

57 01 TGGGTCACCA GCAAATCGCG CTGTTAGCGG G C C C ATT AAG TTCTGTCTCG GCGCGTCTGC 

57 61 GTCTGGCTGG C TGG CAT AAA TATCTCACTC GCAATCAAAT TCAGCCGATA GCGGAACGGG 

58 21 AAGGCGACTG GAGTGCCATG TCCGGTTTTC AACAAACCAT GCAAATGCTG AATGAGGGCA 
58 81 TCGTTCCCAC TGCGATGCTG GTTGCCAACG ATCAGATGGC GCTGGGCGCA ATGCGCGCCA 
5941 TTACCGAGTC CGGGCTGCGC GTTGGTGCGG ATATCTCGGT AGTGGGATAC G AC G AT AC CG 

6 001 AAGACAGCTC ATGTTATATC CCGCCGTTAA CCACCATCAA ACAGGATTTT CGCCTGCTGG 
6 061 GG C AAAC C AG CGTGGACCGC TTGCTGCAAC TCTCTCAGGG CCAGGCGGTG AAGGGCAATC 
6121 AGCTGTTGCC CGTCTCACTG GTGAAAAGAA AAACCACCCT GGCGCCCAAT ACGCAAACCG- 



6181 CCTCTCCCCG CGCGTTGGCC GATTCATTAA TGCAGCTGGC ACGACAGGTT TCCCGACTGG 
6241 AAAGCGGGCA GTGAGCGCAA CGCAATTAAT GTGAGTTAGC TCACTCATTA GGCACCCCAG 
6 3 01 GCTTTACACT TTATGCTTCC GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT 
6361 CACACAGGAA ACAGCTATGA CCATGATTAC GGATTCACTG GCCGTCGTTT TACAACGTCG 
64 21 TGACTGGGAA AACCCTGGCG TTACCCAACT TAATCGCCTT GCAGCACATC CCCCTTTCGC 
64 81 CAGCTGGCGT AATAG CGAAG AGGCCCGCAC CGATCGCCCT TCCCAACAGT TGCGCAGCCT 
6 541 GAATGGCGAA TGGCGCTTTG CCTGGTTTCC GGCACCAGAA GCGGTGCCGG AAAGCTGGCT 

66 01 GGAGTGCGAT CTTCCTGAGG CCGATACTGT CGTCGTCCCC TCAAACTGGC AGATGCACGG 
6661 TTACGATGCG CCCATCTACA CCAACGTAAC CTATCCCATT ACGGTCAATC CGCCGTTTGT 
6721 TCCCACGGAG AATCCGACGG GTTGTTACTC GCTCACATTT AATGTTGATG AAAGCTGGCT 

67 81 ACAGGAAGGC CAGACGCGAA TTATTTTTGA TGGCGTTGGA ATT 
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His6-thioredoxin fusions in E. coli 
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Gene Encoded 

Trc 

attRl 

CmR 
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ccdB 

attR2 
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ori 

flori (fl intergenic region) 
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1 CTATCCGCTG GATGACCAGG ATGCCATTGC 
61 GTTATTTCTT GATGTCTCTG ACCAGACACC 
121 CGGTACGCGA CTGGGCGTGG AGCATCTGGT 
181 AGCGGGCCCA TTAAGTTCTG TCTCGGCGCG 
241 CACTCGCAAT C AAATTC AG C CGATAGCGGA 
3 01 TTTTCAACAA AC CATGC AAA TGCTGAATGA 
3 61 C AACGAT C AG ATGGCGCTGG GCGCAATGCG 
421 TGCGGATATC TCGGTAGTGG GATACG AC GA 
481 GTCAACCACC ATCAAACAGG ATTTTCGCCT 
541 GCAACTCTCT CAGGGCCAGG CGGTGAAGGG 
6 01 AAGAAAAACC ACCCTGGCAC CCAATACGCA 
6 61 ATTAATG C AG CTGGCACGAC AGGTTTCCCG 
721 TTAATGTGAG TTAGCGCGAA TTGATCTGGT 
781 CCAATGCTTC TGGCGTCAGG CAGCCATCGG 
841 TCACTGCATA ATTCGTGTCG CTCAAGGCGC 
901 G AC AT C AT AA CGGTTCTGGC AAATATTCTG 
961 CCGTATAATC TGTGGAATTG TGAGCGGATA 
1021 CATCATCATC ATCATCACGA TTACGATATC 
1081 GCCCATATGA GCGATAAAAT TATTCACCTG 
1141 AAAGCGGACG GGGCGATCCT CGTCGATTTC 
12 01 ATCGCCCCGA TTCTGGATGA AATCGCTGAC 
12 61 CTGAACATCG ATCAAAACCC TGGCACTGCG 
1321 CTGCTGCTGT TCAAAAACGG TGAAGTGGCG 
1381 CAGTTGAAAG AGTTCCTCGA CGCTAACCTG 
1441 AAGGTACCCA TCACAAGTTT GTACAAAAAA 
1501 ATCAATATAT TAAATTAGAT TTTGCATAAA 
1561 CAT AT C C AGT CACTATGGCG GCCGCTAAGT 
1621 AATAAATACC TGTGACGGAA GATCACTTCG 
16 81 ATACCGGGAA GCCCTGGGCC AACTTTTGGC 
1741 GTTCCAACTT TCACCATAAT GAAATAAGAT 
,18 01 GATTTTCAGG AG C TAAGGAA GCTAAAATGG 
1'8 61 ATATATCCCA ATGGCATCGT AAAGAACATT 
1921 C CTATAAC CA GACCGTTCAG CTGGATATTA 
1981 AGCACAAGTT TTATCCGGCC TTTATTCACA 
2 041- AATTCCGTAT GGCAATGAAA GACGGTGAGC 
2101 AC AC CGTTTT CCATGAGCAA ACTGAAACGT 
2161 ATTTCCGGCA GTTTCTACAC ATATATTCGC 
2221 CCTATTTCCC TAAAGGGTTT ATTGAGAATA 
22 81 GTTTCACCAG TTTTGATTTA AACGTGGCCA 
2341 CCATGGGCAA ATATTATACG CAAGGCGACA 
24 01 ATCATGCCGT CTGTGATGGC TTCCATGTCG 
2461 GCGATGAGTG GCAGGGCGGG GCGTAAACGC 
2 521 AGTATGCGTA TTTGCGCGCT GATTTTTGCG 



TGTGGAAGCT GCCTGCACTA ATGTTCCGGC 
CAT C AAC AGT ATTATTTTCT CCCATGAAGA 
CGCATTGGGT CAC CAGCAAA TCGCGCTGTT 
TCTGCGTCTG GCTGGCTGGC ATAAATATCT 
ACGGGAAGGC GACTGGAGTG CCATGTCCGG 
GGGCATCGTT CCCACTGCGA TGCTGGTTGC 
CGCCATTACC GAGTCCGGGC TGCGCGTTGG 
TAC CGAAGAC AGCTCATGTT ATATCCCGCC 
GCTGGGGCAA ACCAGCGTGG ACCGCTTGCT 
CAATCAGCTG TTGCCCGTCT CACTGGTGAA 
AACCGCCTCT CCCCGCGCGT TGGCCGATTC 
ACTGGAAAGC GGGCAGTGAG CGCAACGCAA 
TTGACAGCTT ATCATCGACT GCACGGTGCA 
AAGCTGTGGT ATGGCTGTGC AGGTCGTAAA 
ACTCCCGTTC TGGATAATGT TTTTTGCGCC 
AAATGAGCTG TTGACAATTA ATCATCCGGT 
ACAATTTCAC ACAGGAAACA GACCATGGGT 
CCAACGACCG AAAACCTGTA TTTTCAGGGC 
ACTGACGACA GTTTTGACAC GGATGTACTC 
TGGGCAGAGT GGTGCGGTCC GTGCAAAATG 
GAATATCAGG GCAAACTGAC CGTTGCAAAA 
CCGAAATATG GCATCCGTGG TATCCCGACT 
GCAACCAAAG TGGGTGCACT GTCTAAAGGT 
GCCGGTTCTG GTTCTGGTGA TGACGATGAC 
GCTGAACGAG AAACGTAAAA TGATATAAAT 
AAACAGACTA CATAATACTG TAAAACACAA 
TGGCAGCATC ACCCGACGCA CTTTGCGCCG 
CAGAATAAAT AAATC CTGGT GTCCCTGTTG 
GAAAATGAGA CGTTGATCGG CACGTAAGAG 
CACTACCGGG CGTATTTTTT GAGTTATCGA 
AGAAAAAAAT CACTGGATAT ACCACCGTTG 
TTGAGGCATT TCAGTCAGTT GCTCAATGTA 
CGGCCTTTTT AAAGAC CGTA AAGAAAAATA 
TTCTTGCCCG CCTGATGAAT GCTCATCCGG 
TGGTGATATG GGATAGTGTT CACCCTTGTT 
TTTCATCGCT CTGGAGTGAA TACCACGACG 
AAGATGTGGC GTGTTACGGT GAAAACCTGG 
TGTTTTTCGT CTCAGCCAAT CCCTGGGTGA 
ATATGGACAA CTTCTTCGCC CCCGTTTTCA 
AGGTGCTGAT GCCGCTGGCG ATTCAGGTTC 
GCAGAATGCT TAATGAATTA CAACAGTACT 
GTGGATCCGG CTTACTAAAA G C C AG AT AAC 
GTATAAGAAT ATATACTGAT ATGTATACCC- 



2 581 GAAGTATGTC AAAAAGAGGT GTGCTATGAA G C AG C GT ATT ACAGTGACAG TTGACAGCGA 
2 641 C AG C TAT C AG TTGCTCAAGG CATATATGAT GTCAATATCT CCGGTCTGGT AAGCACAACC 

27 01 ATGCAGAATG AAGCCCGTCG TCTGCGTGCC GAACGCTGGA AAGCGGAAAA TCAGGAAGGG 
2 761 ATGGCTGAGG TCGCCCGGTT TATTGAAATG AACGGCTCTT TTGCTGACGA GAACAGGGAC 
2 821 TGGTGAAATG CAGTTTAAGG TTTACACCTA TAAAAGAGAG AG C CGTTATC GTCTGTTTGT 

28 81 GGATGTACAG AGTGATATTA TTGACACGCC CGGGCGACGG ATGGTGATCC CCCTGGCCAG 

2 941 TGCACGTCTG CTGTCAGATA AAGTCTCCCG TGAACTTTAC CCGGTGGTGC ATATCGGGGA 

3 001 TGAAAGCTGG CGCATGATGA CCACCGATAT GGCCAGTGTG CCGGTCTCCG TTATCGGGGA 
3 061 AGAAGTGGCT GATCTCAGCC ACCGCGAAAA TGACATCAAA AACGCCATTA AC CTG ATGTT 
3121 CTGGGGAATA TAAATGTCAG GCTCCCTTAT AC AC AG C C AG TCTGCAGGTC GACCATAGTG 
3181 ACTGGATATG TTGTGTTTTA CAGTATTATG TAGTCTGTTT TTTATG C AAA ATCTAATTTA 
3241 ATATATTGAT ATTTATATCA TTTTACGTTT CTCGTTCAGC TTTCTTGTAC AAAGTGGTGA 
33 01 TGGGGATCCT CTAGAGTCGA CCTGCAGTAA TCGTACAGGG TAGTACAAAT AAAAAAGGCA 
3361 CGTCAGATGA CGTGCCTTTT TTCTTGTGAG CAGTAAGCTT GGCTGTTTTG GCGGATGAGA 
3421 GAAGATTTTC AG C CTGATAC AGATTAAATC AGAACGCAGA AGCGGTCTGA TAAAACAGAA 
3481 TTTGCCTGGC GGCAGTAGCG CGGTGGTCCC ACCTGACCCC ATGCCGAACT CAGAAGTGAA 
3541 ACGCCGTAGC GCCGATGGTA GTGTGGGGTC TCCCCATGCG AGAGTAGGGA ACTGCCAGGC 
3 6 01 AT C AAATAAA AC G AAAGGCT CAGTCGAAAG ACTGGGCCTT TCGTTTTATC TGTTGTTTGT 
3 661 CGGTGAACGC TCTCCTGAGT AGGACAAATC CGCCGGGAGC GGATTTGAAC GTTGCGAAGC 
3721 AACGGCCCGG AGGGTGGCGG GCAGGACGCC CGCCATAAAC TGCCAGGCAT CAAATTAAGC 
3781 AG AAGG C CAT CCTGACGGAT GGCCTTTTTG CGTTTCTACA AACTCTTTTT GTTTATTTTT 
3 841 C TAAATAC AT TCAAATATGT ATCCGCTCAT GAGACAATAA CCCTGATAAA TGCTTCAATA 
3 901 ATATTGAAAA AGGAAGAGTA TGAGTATTCA ACATTTCCGT GTCGCCCTTA TTCCCTTTTT 

3 961 TGCGGCATTT TGCCTTCCTG TTTTTGCTCA CCCAGAAACG CTGGTGAAAG TAAAAGATGC 

4 021 TGAAGATCAG TTGGGTGCAC GAGTGGGTTA CATCGAACTG GATCTCAACA GCGGTAAGAT 
4 081 C CTTGAGAGT TTTCGCCCCG AAGAACGTTT TCCAATGATG AGCACTTTTA AAGTTCTGCT 
4141 ATGTGGCGCG GTATTATCCC GTGTTGACGC CGGGCAAGAG CAACTCGGTC GCCGCATACA 
42 01 C TATTCTC AG AATGACTTGG TTGAGTACTC ACCAGTCACA GAAAAGC AT C TTACGGATGG 
4261 CATGACAGTA AGAGAATTAT GCAGTGCTGC CATAACCATG AGTGATAACA CTGCGGCCAA 
4321 CTTACTTCTG ACAACGATCG GAGGACCGAA GGAGCTAACC GCTTTTTTGC ACAACATGGG 
4381 GGATCATGTA ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG AATGAAGCCA TACCAAACGA 
4441 CGAGCGTGAC ACCACGATGC CTACAGCAAT GGCAACAACG TTGCGCAAAC TATTAACTGG 



4501 CGAACTACTT ACTCTAGCTT CCCGGCAACA ATTAATAGAC TGGATGGAGG CGGATAAAGT 
4 561 TGCAGGACCA CTTCTGCGCT CGGCCCTTCC GGCTGGCTGG TTTATTGCTG ATAAATCTGG 
4621 AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG GGGCCAGATG GTAAGCCCTC 
4681 CCGTATCGTA GTTATCTACA CGACGGGGAG TCAGGCAACT ATGGATGAAC GAAATAGACA 
4 741 GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA CTGTCAGACC AAGTTTACTC 
4801 ATATATACTT TAGATTGATT TAAAACTTCA TTTTTAATTT AAAAGGATCT AGGTGAAGAT 
4861 CCTTTTTGAT AATCTCATGA CCAAAATCCC TTAACGTGAG TTTTCGTTCC ACTGAGCGTC 
4 921 AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT TTTTTTCTGC GCGTAATCTG 
4 981 CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT TGTTTGCCGG ATCAAG AG C T 
5041 ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG C AG AT AC C AA ATACTGTCCT 
5101 TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT GTAGCACCGC CTACATACCT 
5161 CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG 
5221 GTTGGACTCA AGAC G ATAGT TAC CGGATAA GGCGCAGCGG TCGGGCTGAA CGGGGGGTTC 



5*2 81 GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA CTGAGATACC TACAGCGTGA 

5341 GCTATGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG GACAGGTATC CGGTAAGCGG 

54 01 CAGGGTCGGA AC AGGAGAG C GCACGAGGGA GCTTCCAGGG GGAAACGCCT GGTATCTTTA 

54 61-s TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 

5521 GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 

5581 CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAAC CGTAT 

5 641 TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC 
5701 AGTGAGCGAG GAAG CGGAAG AGCGCCTGAT GCGGTATTTT CTCCTTACGC ATCTGTGCGG 

57 61 TATTTCACAC CGCATAATTT TGTTAAAATT CGCGTTAAAT TTTTGTTAAA TCAGCTCATT 
5821 TTTTAACCAA TAGGCCGAAA TCGGCAAAAT CCCTTATAAA TCAAAAGAAT AGACCGAGAT 

58 81 AGGGTTGAGT GTTGTTCCAG TTTGGAACAA GAGTCCACTA TTAAAGAACG TGGACTCCAA 
5941 CGTCAAAGGG CGAAAAACCG TCTATCAGGG CGATGGCCCA CTACGTGAAC CATCACCCTA 

6 001 ATCAAGTTTT TTGGGGTCGA GGTGCCGTAA AGCACTAAAT CGGAACCCTA AAGGGAGCCC - 




6 061 CCGATTTAGA GCTTGACGGG GAAAGCCGGC GAACGTGGCG AGAAAGGAAG GGAAGAAAGC 
6121 GAAAGGAGCG GGCGCTAGGG CGCTGGCAAG TGTAGCGGTC ACGCTGCGCG TAACCACCAC 
6181 ACCCGCCGCG CTTAATGCGC CGCTACAGGG CGCGTCCATT CGCCATTCAG GCTGCTATGG 
6241 TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT 

63 01 CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT 
6361 GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT 
6421 GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCAG ATCAATTCGC 

64 81 GCGCGAAGGC GAAGCGGCAT GCATTTACGT TGACACCATC G AATGGTG C A AAACCTTTCG 
6541 CGGTATGGCA TGATAGCGCC CGGAAGAGAG TCAATTCAGG GTGGTGAATG TGAAACCAGT 
6601 AACGTTATAC GATGTCGCAG AGTATGCCGG TGTCTCTTAT C AG AC C GTTT CCCGCGTGGT 
66 61 GAACCAGGCC AGCCACGTTT CTGCGAAAAC GCGGGAAAAA GTGGAAGCGG CGATGGCGGA 
6721 GCTGAATTAC ATTCCCAACC GCGTGGCACA ACAACTGGCG GGCAAACAGT CGTTGCTGAT 
6781 TGGCGTTGCC ACCTCCAGTC TGGCCCTGCA CGCGCCGTCG CAAATTGTCG CGGCGATTAA 
6841 ATCTCGCGCC GATCAACTGG GTGCCAGCGT GGTGGTGTCG ATGGTAGAAC GAAGCGGCGT 
6 9 01 CGAAGCCTGT AAAGCGGCGG TGCACAATCT TCTCGCGCAA CGCGTCAGTN GGGCTGATCA 
6961 TTAA 
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Location (Base Nos . ) 
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Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

fl (fl intergenic region) 

ampR 

ori 

lad 



1 AGGCACCCCA GGCTTTACAC TTTATGCTTC CGGCTCGTAT GTTGTGTGGA ATTGTGAGCG 
61 GATAACAATT TCACACAGGA AACAGCTATG ACCATGATTA CGCCAAGCTC TAATACGACT 
121 CACTATAGGG AAAGCTGGTA CGCCTGCAGG TACCGGTCCG GAATTCCCGG GTCGACGATC 
181 ACAAGTTTGT ACAAAAAAGC TGAACGAGAA ACGTAAAATG ATATAAATAT CAATATATTA 
241 AATTAGATTT TGCATAAAAA ACAGACTACA TAATACTGTA AAACACAACA TATCCAGTCA 
3 01 CTATGGCGGC CGCTAAGTTG G C AG CATC AC CCGACGCACT TTGCGCCGAA TAAATAC CTG 
3 61 TGACGGAAGA TCACTTCGCA GAATAAATAA ATCCTGGTGT CCCTGTTGAT ACCGGGAAGC 
421 CCTGGGCCAA CTTTTGGCGA AAATGAGACG TTGATCGGCA CGTAAGAGGT TCCAACTTTC 
f** 481 AC C ATAATGA AATAAGATCA CTACCGGGCG TATTTTTTGA GTTATCGAGA TTTTCAGGAG 

Hi 541 CTAAGGAAG C TAAAATGGAG AAAAAAATCA CTGGATATAC CACCGTTGAT ATATCCCAAT 

?£* 6 01 GGCATCGTAA AGAACATTTT GAGGCATTTC AGTCAGTTGC TCAATGTACC TATAACCAGA 

TV 661 CCGTTCAGCT GGATATTACG GCCTTTTTAA AG AC C GTAAA GAAAAATAAG CACAAGTTTT 

JT" 721 ATCCGGCCTT TATTCACATT CTTGCCCGCC TGATGAATGC TCATCCGGAA TTC CGTATGG 

^ 781 CAATGAAAGA CGGTGAGCTG GTGATATGGG ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC 

21 841 ATGAGCAAAC TGAAACGTTT TCATCGCTCT GGAGTGAATA CCACGACGAT TTCCGGCAGT 

0j 901 TTCTACACAT ATATTCGCAA GATGTGGCGT GTTACGGTGA AAACCTGGCC TATTTCCCTA 

^ 961 AAGGGTTTAT TGAGAATATG TTTTTCGTCT CAGCCAATCC CTGGGTGAGT TTCACCAGTT 

1 1021 TTGATTTAAA CGTGGCCAAT ATGGACAACT TCTTCGCCCC CGTTTTCACC ATGGGCAAAT 

O 1081 ATTATACGCA AGGCGACAAG GTGCTGATGC CGCTGGCGAT TCAGGTTCAT CATGCCGTCT 

bj 1141 GTGATGGCTT CCATGTCGGC AGAATG CTTA ATGAATTACA ACAGTACTGC GATGAGTGGC 

□ 12 01 AGGGCGGGGC GTAAACGCGT GGATCCGGCT TACTAAAAGC CAGATAACAG TATGCGTATT 

fy 12 61 TGCGCGCTGA TTTTTGCGGT ATAAGAATAT ATACTGATAT GTATACCCGA AGTATGTCAA 

p 13 21 AAAG AG GTGT GCTATGAAGC AG CGTATTAC AG TG AC AGTT GACAGCGACA GCTATCAGTT 

ffi 13 81 GCTCAAGGCA TATATGATGT CAATATCTCC GGTCTGGTAA G C AC AAC CAT GCAGAATGAA 

14 41 GCCCGTCGTC TGCGTGCCGA ACGCTGGAAA GCGGAAAATC AGGAAGGGAT GGCTGAGGTC 

15 01 GCCCGGTTTA TTGAAATGAA CGGCTCTTTT GCTGACGAGA ACAGGGACTG GTGAAATGCA 
1561 GTTTAAGGTT TACACCTATA AAAGAGAGAG CCGTTATCGT CTGTTTGTGG ATGTACAGAG 

16 21 TGATATTATT GACACGCCCG GGCGACGGAT GGTGATCCCC CTGGCCAGTG CACGTCTGCT 
1681 GTCAGATAAA GTCTCCCGTG AACTTTACCC GGTGGTGCAT ATCGGGGATG AAAGCTGGCG 
1741 CATGATGACC AC CG ATATGG CCAGTGTGCC GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA 
18 01 TCTCAGCCAC CGCGAAAATG AC AT C AAAAA CGCCATTAAC CTGATGTTCT GGG GAATATA 
18 61 AATGTCAGGC TCCCTTATAC ACAGCCAGTC TGCAGGTCGA CCATAGTGAC TGGATATGTT 
1'921 GTGTTTTACA GTATTATGTA GTCTGTTTTT TATGCAAAAT CTAATTTAAT ATATTGATAT 
1981 TTATATCATT TTACGTTTCT CGTTCAGCTT TCTTGTACAA AGTGGTGATC ACTAGTCGGC 
2 041 GGCCGCTCTA GAGGATCCAA GCTTACGTAC GCGTGCATGC GACGTCATAG CTCTTCTATA 
2 1 0 1 - .GTGTCACCTA AATTCAATTC ACTGGCCGTC GTTTTACAAC GTCGTGACTG GGAAAACCCT 
2161 GGCGTTACCC AACTTAATCG CCTTGCAGCA CATCCCCCTT TCGCCAGCTG GCGTAATAGC 
2221 GAAGAGGCCC GCACCGATCG CCCTTCCCAA CAGTTGCGCA GCCTGAATGG CGAATGGACG 
2281 CGCCCTGTAG CGGCGCATTA AGCGCGGCGG GTGTGGTGGT TACGCGCAGC GTGACCGCTA 
2341 CACTTGCCAG CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT CCCTTCCTTT CTCGCCACGT 
24 01 TCGCCGGCTT TCCCCGTCAA GCTCTAAATC GGGGGCTCCC TTTAGGGTTC CGATTTAGTG 
24 61 CTTTACGGCA CCTCGACCCC AAAAAACTTG ATTAGGGTGA TGGTTCACGT AGTGGGCCAT 
2 521 CGCCCTGATA GACGGTTTTT CGCCCTTTGA CGTTGGAGTC CACGTTCTTT AATAGTGGAC 
2 581 TCTTGTTCCA AACTGGAACA ACACTCAACC CTATCTCGGT CTATTCTTTT GATTTATAAG- 



2 641 GGATTTTGCC GATTTCGGCC TATTGGTTAA AAAATGAGCT GATTTAACAA AAATTTAACG 
2 701 CGAATTTTAA CAAAATATTA ACGTTTACAA TTTCAGGTGG CACTTTTCGG GGAAATGTGC 
2761 GCGGAACCCC TATTTGTTTA TTTTTCTAAA TACATTCAAA TATGTATCCG CTCATGAGAC 
2821 AATAACCCTG ATAAATGCTT CAATAATATT GAAAAAGGAA GAGTATGAGT ATTCAACATT 
2881 TCCGTGTCGC CCTTATTCCC TTTTTTGCGG CATTTTGCCT TCCTGTTTTT GCTCACCCAG 

2 941 AAACGCTGGT GAAAGTAAAA GATGCTGAAG ATCAGTTGGG TGCACGAGTG GGTTACATCG 

3 001 AACTGGATCT CAACAGCGGT AAGATC CTTG AGAGTTTTCG CCCCGAAGAA CGTTTTCCAA 
3 061 TGATGAGCAC TTTTAAAGTT CTGCTATGTG GCGCGGTATT ATCCCGTATT GACGCCGGGC 
3121 AAG AG CAACT CGGTCGCCGC ATACACTATT CTCAGAATGA CTTGGTTGAG TAC T C AC C AG 
3181 TCACAGAAAA GCATCTTACG GATGGCATGA CAGTAAGAGA ATTATGCAGT GCTGCCATAA 
3 241 C C ATGAGTG A TAACACTGCG GCCAACTTAC TTCTGACAAC GATCGGAGGA CCGAAGGAGC 
3 301 TAACCGCTTT TTTGCACAAC ATGGGGGATC ATGTAACTCG CCTTGATCGT TGGGAACCGG 
3361 AG CTGAATGA AG C CAT AC C A AACGACGAGC GTG AC AC C AC GATGCCTGTA GCAATGGCAA 
3421 CAACGTTGCG CAAACTATTA ACTGGCGAAC TAC TTACTCT AGCTTCCCGG CAACAATTAA 
34 81 TAGACTGGAT GGAGGCGGAT AAAGTTGCAG GACCACTTCT GCGCTCGGCC CTTCCGGCTG 
3 541 GCTGGTTTAT T G C TG AT AAA TCTGGAGCCG GTGAGCGTGG GTCTCGCGGT ATCATTGCAG 
3601 CACTGGGGCC AGATGGTAAG CCCTCCCGTA TCGTAGTTAT CTACACGACG GGGAGTCAGG 
3661 CAACTATGGA TGAACGAAAT AGACAGATCG CTGAGATAGG TGCCTCACTG ATTAAGCATT 
3 721 GGTAACTGTC AG AC C AAGTT TACTCATATA TACTTTAGAT TGATTTAAAA CTTCATTTTT 
3 781 AATTTAAAAG GATCTAGGTG AAGATC CTTT TTGATAATCT C ATG AC C AAA AT C C CTTAAC 
3 841 GTGAGTTTTC GTTCCACTGA GCGTCAGACC CCGTAGAAAA GATCAAAGGA TCTTCTTGAG 
3 901 ATCCTTTTTT TCTGCGCGTA ATCTGCTGCT T G C AAAC AAA AAAACCACCG CTACCAGCGG 

3 961 TGGTTTGTTT GCCGGATCAA GAG CT AC C AA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA 

4 021 GAGCGCAGAT ACCAAATACT GTCCTTCTAG TGTAGCCGTA GTTAGGCCAC CACTTCAAGA 
4 081 ACTCTGTAGC ACCGCCTACA TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA 
4141 GTGGCGATAA GTCGTGTCTT ACCGGGTTGG ACTCAAGACG ATAGTTACCG GATAAGGCGC 
42 01 AGCGGTCGGG CTGAACGGGG GGTTCGTGCA CACAGCCCAG CTTGGAGCGA ACGACCTACA 
4261 CCGAACTGAG AT AC CTACAG C GTG AG C ATT GAGAAAGCGC CACGCTTCCC GAAG GGAGAA 
4 321 AGGCGGACAG GTATCCGGTA AGCGGCAGGG TCGGAACAGG AGAGCGCACG AGGGAGCTTC 
4 381 CAGGGGGAAA CGCCTGGTAT CTTTATAGTC CTGTCGGGTT TCGCCACCTC TG ACTTGAG C 
4441 GTCGATTTTT GTGATGCTCG TCAGGGGGGC GGAGCCTATG GAAAAACGCC AGCAACGCGG 

45 01 CCTTTTTACG GTTCCTGGCC TTTTGCTGGC CTTTTGCTCA CATGTTCTTT CCTGCGTTAT 
4 561 CCCCTGATTC TGTGGATAAC CGTATTACCG CCTTTGAGTG AG CTGATAC C GCTCGCCGCA 
4 621 GCCGAACGAC CGAGCGCAGC GAGTCAGTGA GCGAGGAAGC GGAAGAGCGC CCAATACGCA 

46 81 AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATT AATG C AG AGCTTGCAAT TCGCGCGCGA 
4741 AGGCGAAGCG GCATTTACGT TG AC AC CAT C GAATGGCGCA AAACCTTTCG CGGTATGGCA 
4 8 01 TGATAGCGCC CGGAAGAGAG TCAATTCAGG GTGGTGAATG TGAAACCAGT AACGTTATAC 
4861 GATGTCGCAG AGTATGCCGG TGTCTCTTAT CAGACCGTTT CCCGCGTGGT GAACCAGGCC 
4 921 AGCCACGTTT CTGCGAAAAC GCGGGAAAAA GTGGAAGCGG CGATGGCGGA GCTGAATTAC 

4 981 ATTCCCAACC GCGTGGCACA ACAACTGGCG GGCAAACAGT CGTTGCTGAT TGGCGTTGCC 

5 041 ACCTCCAGTC TGGCCCTGCA CGCGCCGTCG CAAATTGTCG CGGCGATTAA ATCTCGCGCC 
5101 GATCAACTGG GTGCCAGCGT GGTGGTGTCG ATGGTAGAAC GAAGCGGCGT CGAAGCCTGT 
5161 AAAGCGGCGG TGCACAATCT TCTCGCGCAA CGGGTCAGTG GGCTGATCAT TAACTATCCG 
5221 CTGGATGACC AGG ATG C CAT TGCTGTGGAA GCTGCCTGCA CTAATGTTCC GGCGTTATTT 
52 81 CTTGATGTCT CTGACCAGAC ACCCATCAAC AGTATTATTT TCTCCCATGA AG AC G GT AC G 

/ 5'3 41 CGACTGGGCG TGGAGCATCT GGTCGCATTG GGTCACCAGC AAATCGCGCT GTTAGCGGGC 
54 01 C C ATT AAG TT CTGTCTCGGC GCGTCTGCGT CTGGCTGGCT GGCATAAATA TCTCACTCGC 

54 61 AATCAAATTC AG C CGATAG C GGAACGGGAA GGCGACTGGA GTGCCATGTC CGGTTTTCAA 
5 521, CAAACCATGC AAATGCTGAA TGAGGG CATC GTTCCCACTG CGATGCTGGT TG C C AAC GAT 

55 81 CAGATGGCGC TGGGCGCAAT GCGCGCCATT ACCGAGTCCG GGCTGCGCGT TGGTGCGGAT 

56 41 ATCTCGGTAG TGGGATACGA CGATAC CGAA GACAGCTCAT GTTATATCCC GCCGTCAACC 

57 01 ACCATCAAAC AGGATTTTCG CCTGCTGGGG C AAAC C AG CG TGGACCGCTT GCTGCAACTC 

57 61 TCTCAGGGCC AGGCGGTGAA GGGCAATCAG CTGTTGCCCG TCTCACTGGT GAAAAGAAAA 
5821 ACCACCCTGG CGCCCAATAC GCAAACCGCC TCTCCCCGCG CGTTGGCCGA TTCATTAATG 

58 81 CAGCTGGCAC GACAGGTTTC CCGACTGGAA AGCGGGCAGT GAGCGCAACG CAATTAATGT 
5 941 GAGTTAG CT C ACTCATT 
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pDEST6 5957 bp 

Location (Rase Nos . ) 
266. .142 
516 . . 1175 
1295 . . 1379 
1517 . . 1822 
1863 . .1987 
2203 . . 3369 
4403 . . 5260 
5392 . . 5847 

1 TAACGCCAGG GTTTTCCCAG TCACGACGTT GTAAAACGAC GGCCAGTGAA TTGAATTTAG 
61 GTGACACTAT AGAAGAGCTA TGACGTCGCA TGCACGCGTA CGTAAGCTTG GATCCTCTAG 
121 AGCGGCCGCC GACTAGTGAT CACAAGTTTG TACAAAAAAG CTGAACGAGA AACGTAAAAT 
181 GATATAAATA TCAATATATT AAATTAGATT TTGCATAAAA AACAGACTAC ATAATACTGT 
241 AAAACACAAC ATATCCAGTC ACTATGGCGG CCGCTAAGTT GGCAGCATCA CCCGACGCAC 
3 01 TTTGCGCCGA ATAAATACCT GTGACGGAAG ATCACTTCGC AGAATAAATA AATCCTGGTG 
361 TCCCTGTTGA TACCGGGAAG CCCTGGGCCA ACTTTTGGCG AAAATGAGAC GTTGATCGGC 
421 ACGTAAGAGG TTCCAACTTT C AC C ATAATG AAATAAGATC ACTACCGGGC GTATTTTTTG 
481 AGTTAT C GAG ATTTTCAGGA GCTAAGGAAG CTAAAATGGA GAAAAAAATC ACTGGATATA 
541 CCACCGTTGA TATATCCCAA TGGCATCGTA AAGAACATTT TGAGGCATTT CAGTCAGTTG 
6 01 CTCAATGTAC CTATAAC C AG ACCGTTCAGC TGGATATTAC GGCCTTTTTA AAGACCGTAA 
661 AGAAAAATAA GCACAAGTTT TATCCGGCCT TTATTCACAT TCTTGCCCGC CTGATGAATG 
721 CTCATCCGGA ATTCCGTATG GCAATGAAAG ACGGTG AG C T GGTGATATGG GATAGTGTTC 
781 ACCCTTGTTA CACCGTTTTC C ATG AG C AAA CTGAAACGTT TTCATCGCTC TGGAGTGAAT 
841 ACCACGACGA TTTCCGGCAG TTTCTACACA TATATTCGCA AGATGTGGCG TGTTACGGTG 
9 01 AAAACCTGGC CTATTTCCCT AAAGGGTTTA TTGAGAATAT GTTTTTCGTC TCAGCCAATC 
961 CCTGGGTGAG TTTCACCAGT TTTGATTTAA ACGTGGCCAA TATGGACAAC TTCTTCGCCC 
1021 CCGTTTTCAC CATGGGCAAA TATTATACGC AAGGCGACAA GGTGCTGATG CCGCTGGCGA 
1081 TTCAGGTTCA TCATGCCGTC TGTGATGGCT TCCATGTCGG CAGAATGCTT AATGAATTAC 
1141 AACAGTACTG CGATGAGTGG CAGGGCGGGG CGTAAACGCG TGGATCCGGC TTACTAAAAG 
12 01 C C AG AT AAC A GTATGCGTAT TTGCGCGCTG ATTTTTGCGG TATAAGAATA TAT AC TG AT A 

12 61 TGTATACCCG AAGTATGTCA AAAAGAGGTG TGCTATGAAG CAGCGTATTA CAGTGACAGT 
1321 TG AC AG C G AC AGCTATCAGT TGCTCAAGGC ATATATGATG TCAATATCTC CGGTCTGGTA 

13 81 AGCACAAC C A TGCAGAATGA AGCCCGTCGT CTGCGTGCCG AACGCTGGAA AGCGGAAAAT 

14 41 CAGGAAGGGA TGGCTGAGGT CGCCCGGTTT ATTGAAATGA ACGGCTCTTT TGCTGACGAG 
1501 AACAGGGACT GGTGAAATGC AGTTTAAGGT TTACACCTAT AAAAGAGAGA GCCGTTATCG 
1561 TCTGTTTGTG GATGTACAGA GTGATATTAT TGACACGCCC GGGCGACGGA TGGTGATCCC 
1621 CCTGGCCAGT GCACGTCTGC TGTCAGATAA AGTCTCCCGT GAACTTTACC CGGTGGTGCA 
16 81 TATCGGGGAT GAAAGCTGGC GCATGATGAC CACCGATATG GCCAGTGTGC CGGTCTCCGT 
1741 TATCGGGGAA GAAGTGGCTG ATCTCAGCCA CCGCGAAAAT GACATCAAAA ACGCCATTAA 
18 01 CCTGATGTTC TGGGGAATAT AAATGTCAGG CTCCCTTATA CACAGCCAGT CTGCAGGTCG 
18 61 ACCATAGTGA CTGGATATGT TGTGTTTTAC AGTATTATGT AGTCTGTTTT TTATGCAAAA 
1921 TCTAATTTAA TATATTGATA TTTATATCAT TTTACGTTTC TCGTTCAGCT TTCTTGTACA 
1981 AAGTGGTGAT CGTCGACCCG GGAATTCCGG ACCGGTACCT GCAGGCGTAC CAGCTTTCCC 
2 041 TATAGTGAGT CGTATTAGAG CTTGGCGTAA TCATGGTCAT AGCTGTTTCC TGTGTGAAAT 
2101 TGTTATCCGC TCACAATTCC ACACAACATA CGAGCCGGAA GCATAAAGTG TAAAGCCTGG 
2 1 6 1 5 GGTGCCTAAT GAGTGAGCTA ACTCACATTA ATTGCGTTGC GCTCACTGCC CGCTTTCCAG 
2221 TCGGGAAACC TGTCGTGCCA GCTGCATTAA TGAATCGGCC AACGCGCGGG GAGAGGCGGT 
22 81 TTGCGTATTG GGCGCCAGGG TGGTTTTTCT TTTCACCAGT GAGACGGGCA ACAGCTGATT 
2 341 GCCCTTCACC GCCTGGCCCT GAGAGAGTTG CAGCAAGCGG TCCACGCTGG TTTGCCCCAG 
24 01 CAGGCGAAAA TCCTGTTTGA TGGTGGTTGA CGGCGGGATA TAACATGAGC TGTCTTCGGT 
24 61 ATCGTCGTAT CCCACTACCG AGATATCCGC ACCAACGCGC AGCCCGGACT CGGTAATGGC 
2 521 GCGCATTGCG CCCAGCGCCA TCTGATCGTT GG C AAC C AG C ATCGCAGTGG GAACGATGCC 
2 581 CTCATTCAGC ATTTGCATGG TTTGTTGAAA ACCGGACATG GCACTCCAGT CGCCTTCCCG 
2641 TTCCGCTATC GGCTGAATTT GATTGCGAGT GAGATATTTA TGCCAGCCAG C C AG AC G C AG - 
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27 01 ACGCGCCGAG ACAGAACTTA ATGGGCCCGC TAACAGCGCG ATTTGCTGGT GACCCAATGC 
2761 GAC CAGATGC TCCACGCCCA GTCGCGTACC GTCTTCATGG GAGAAAATAA TACTGTTGAT 
2 821 GGGTGTCTGG TCAGAGACAT CAAGAAATAA CGCCGGAACA TTAGTGCAGG CAGCTTCCAC 
2881 AGCAATGGCA TCCTGGTCAT CCAGCGGATA GTTAATGATC AGCCCACTGA CCCGTTGCGC 

2 941 GAGAAGATTG TGCACCGCCG CTTTACAGGC TTCGACGCCG CTTCGTTCTA CCATCGACAC 

3 0 01 CACCACGCTG GCACCCAGTT GATCGGCGCG AGATTTAATC GCCGCGACAA TTTGCGACGG 
3061 CGCGTGCAGG GCCAGACTGG AGGTGGCAAC GCCAATCAGC AACGACTGTT TGCCCGCCAG 
3121 TTGTTGTGCC ACGCGGTTGG GAATGTAATT CAGCTCCGCC ATCGCCGCTT CCACTTTTTC 
3181 CCGCGTTTTC GCAGAAACGT GGCTGGCCTG GTTCACCACG CGGGAAACGG TCTGATAAGA 
3241 GACACCGGCA TACTCTGCGA CAT C GTATAA CGTTACTGGT TTCACATTCA CCACCCTGAA 

33 01 TTGACTCTCT TCCGGGCGCT ATCATGCCAT ACCGCGAAAG GTTTTGCGCC ATTCGATGGT 
3361 GTCAACGTAA ATGCCGCTTC GCCTTCGCGC GCGAATTGCA AGCTCTGCAT TAATGAATCG 

34 21 GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC TCGCTCACTG 
34 81 ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA AAGGCGGTAA 
3 541 TACGGTTATC CACAGAATCA GGGGATAACG CAGGAAAGAA CATGTGAGCA AAAGGCCAGC 

36 01 AAAAGG C C AG GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG CTCCGCCCCC 
3661 CTGACGAGCA T C AC AAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT 
3721 AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT CCGACCCTGC 

37 81 CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT TCTCAATGCT 
3841 CACGCTGTAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC TGTGTGCACG 

3 9 01 AACCCCCCGT TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATCGTCTT GAGTCCAACC 
3961 CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA 

4 021 GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTACGGC TACACTAGAA 
4081 GGACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA 
4141 GCTCTTGATC CGGCAAACAA ACCACCGCTG GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC 
42 01 AGATTACGCG CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT ACGGGGTCTG 
42 61 ACGCTCAGTG G AAC G AAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA TCAAAAAGGA 
4321 TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA AGTATATATG 
4381 AGTAAACTTG GTCTGACAGT TAC CAATGCT TAATCAGTGA GGCACCTATC TCAGCGATCT 
44 41 GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT GTAGATAACT ACGATACGGG 
4 501 AGGGCTTACC ATCTGGCCCC AGTGCTGCAA TGATACCGCG AGACCCACGC TCACCGGCTC 
4 561 CAGATTTATC AGCAATAAAC CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA 
4 621 CTTTATCCGC CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC 
4681 CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG TCACGCTCGT 
4 741 CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC AAGGCGAGTT ACATGATCCC 
48 01 CCATGTTGTG CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GATCGTTGTC AGAAGTAAGT 
4861 TGGCCGCAGT GTTATCACTC ATGGTTATGG CAGCACTGCA TAATTCTCTT ACTGTCATGC 
4 921 CATCCGTAAG ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC TGAGAATAGT 

4 981 GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC GCGCCACATA 

5 041 GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC GGGGCGAAAA CTCTCAAGGA 
5101 TCTTACCGCT GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC TGATCTTCAG 
5161 CATCTTTTAC TTTCACCAGC GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA AATGCCGCAA 
5221 AAAAGGGAAT AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT TTTCAATATT 
52 81 ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA TGTATTTAGA 
5341 AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA AGTGCCACCT GAAATTGTAA 

'54 01 ACGTTAATAT TTTGTTAAAA TTCGCGTTAA ATTTTTGTTA AATCAGCTCA TTTTTTAACC 
54 61 AATAGGCCGA AATCGGCAAA ATCCCTTATA AATCAAAAGA ATAGACCGAG ATAGGGTTGA 
5 521 GTGTTGTTCC AGTTTGGAAC AAGAGTCCAC TATTAAAGAA CGTGGACTCC AACGTCAAAG 
5581^ GGCGAAAAAC CGTCTATCAG GGCGATGGCC CACTACGTGA ACCATCACCC TAATCAAGTT 
5641 TTTTGGGGTC GAGGTGCCGT AAAGCACTAA ATCGGAACCC TAAAGGGAGC CCCCGATTTA 
57 01 GAGCTTGACG GGGAAAGCCG GCGAACGTGG CGAGAAAGGA AGGGAAGAAA GCGAAAGGAG 
57 61 CGGGCGCTAG GGCGCTGGCA AGTGTAGCGG TCACGCTGCG CGTAACCACC ACACCCGCCG 
5821 CGCTTAATGC GCCGCTACAG GGCGCGTCCA TTCGCCATTC AGGCTGCGCA ACTGTTGGGA 
5881 AGGGCGATCG GTGCGGGCCT CTTCGCTATT ACGCCAGCTG GCGAAAGGGG GATGTGCTGC 
5941 AAGGCGATTA AGTTGGG 
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pDEST7 6025 bp (rotated to position 2800) 



Location (Base Nos . ) 
67 . . 589 
906 . .782 



1015 . . 1674 
1794 . . 1878 
2016 . .2321 
2362 . .2486 
2671 . .3033 
3227 . .3502 
3962 . .4822 
5022 . . 5661 



Gene Encoded 
CMV promoter 
attRl 
CmR 

inactivated ccdA 

ccdB 

attR2 

small t & polyA 
f 1 

ampR 



on 



1 ATTATCATGA CATTAACCTA TAAAAATAGG CGTAGTACGA GGCCCTTTCA CTCATTAGAT 
61 GCATGTCGTT ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 
121 CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA CTTTCCATTG 
181 ACGTCAATGG GTGGAGTATT TACGGTAAAC TGCCCACTTG GCAGTACATC AAGTGTATCA 
241 TATGCCAAGT ACGCCCCCTA TTGACGTCAA TGACGGTAAA TGGCCCGCCT GGCATTATGC 
3 01 CCAGTACATG ACCTTATGGG ACTTTCCTAC TTGGCAGTAC AT CTACGTAT TAGTCATCGC 
361 TATTACCATG GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC GGTTTGACTC 
421 ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT GGCACCAAAA 
481 TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA TTGACGCAAA TGGGCGGTAG 
541 GCGTGTACGG TGGGAGGTCT AT AT AAG C AG AGCTCGTTTA GTGAACCGTC AGATCGCCTG 
6 01 GAGACGCCAT CCACGCTGTT TTGACCTCCA TAGAAGACAC CGGGACCGAT CCAGCCTCCG 
661 GACTCTAGCC TAGGCCGCGG AGCGGATAAC AATTTCACAC AGGAAACAGC TATGACCATT 
721 AGGCCTTTGC AAAAAGCTAT TTAGGTGACA CTATAGAAGG TACGCCTGCA GGTACCGGAT 
781 CACAAGTTTG TACAAAAAAG CTGAACGAGA AACGTAAAAT GATATAAATA TCAATATATT 
841 AAATTAGATT TTGCATAAAA AAC AG AC TAC ATAATACTGT AAAACACAAC ATATCCAGTC 
9 01 ACTATGGCGG CCGCATTAGG CACCCCAGGC TTTACAC TTT ATGCTTCCGG CTCGTATAAT 
961 GTGTGGATTT TGAGTTAGGA TCCGTCGAGA TTTT C AG GAG CTAAGG AAG C TAAAATGGAG 
1021 AAAAAAATCA CTGGATATAC CACCGTTGAT ATATCCCAAT GGCATCGTAA AGAACATTTT 
1081 GAGGCATTTC AGTCAGTTGC TCAATGTACC TATAACCAGA CCGTTCAGCT GGATATTACG 
114 1 GCCTTTTTAA AGAC CGTAAA GAAAAATAAG CACAAGTTTT ATCCGGCCTT TATTCACATT 
12 01 CTTGCCCGCC TGATGAATGC TCATCCGGAA TTCCGTATGG CAATGAAAGA CGGTGAGCTG 

12 61 GTGATATGGG ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC ATGAGCAAAC TGAAACGTTT 
1321 TCATCGCTCT GGAGTGAATA CCACGACGAT TTCCGGCAGT TTCTACACAT ATATTCGCAA 

13 81 GATGTGGCGT GTTACGGTGA AAACCTGGCC TATTTCCCTA AAGGGTTTAT TGAGAATATG 

14 41 TTTTTC GTCT CAGCCAATCC CTGGGTGAGT TTCACCAGTT TTGATTTAAA CGTGGCCAAT 

15 01 ATGGACAACT TCTTCGCCCC CGTTTTCACC ATGGGCAAAT ATTATACGCA AGGCGACAAG 
1561 GTGCTGATGC CGCTGGCGAT TCAGGTTCAT CATGCCGTCT GTGATGGCTT CCATGTCGGC 
1621 AGAATGCTTA ATGAATTACA ACAGTACTGC GATGAGTGGC AGGGCGGGGC GTAAACGCGT 

16 81 GGATCCGGCT TACTAAAAGC CAGATAACAG TATGCGTATT TGCGCGCTGA TTTTTGCGGT 
1741 ATAAGAATAT ATACTGATAT GTATACCCGA AGTATGTCAA AAAGAGGTGT GCTATGAAGC 
18 01 AGCGTATTAC AGTGACAGTT GAC AG CGACA GCTATCAGTT GCTCAAGGCA TATATGATGT 
18 61 CAATATCTCC GGTCTGGTAA G C AC AAC CAT GCAGAATGAA GCCCGTCGTC TGCGTGCCGA 
1921 ACGCTGGAAA GCGGAAAATC AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA TTGAAATGAA 
1981 CGGCTCTTTT GCTGACGAGA ACAGGGACTG GTGAAATGCA GTTTAAGGTT TACACCTATA 
2 041: AAAGAGAGAG CCGTTATCGT CTGTTTGTGG ATG TAC AG AG TGATATTATT GACACGCCCG 
2101 GGCGACGGAT GGTGATCCCC CTGGCCAGTG CACGTCTGCT GT C AG AT AAA GTCTCCCGTG 
2161 AACTTTACCC GGTGGTGCAT ATCGGGGATG AAAGCTGGCG CATGATGACC AC C GATATGG 
22 21 CCAGTGTGCC GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC CGCGAAAATG 
2 281 AC AT C AAAAA C G C C ATT AAC CTGATGTTCT GGGGAATATA AATGTCAGGC TCCCTTATAC 
2 341 ACAGCCAGTC TGCAGGTCGA CCATAGTGAC TGGATATGTT GTGTTTTACA GTATTATGTA 
24 01 GTCTGTTTTT TATGCAAAAT CTAATTTAAT ATATTGATAT TTATATCATT TTACGTTTCT 
24 61 CGTTCAGCTT TCTTGTACAA AGTGGTGATC GCGTGCATGC GACGTCATAG CTCTCTCCCT 
2 521 ATAGTGAGTC GTATTATAAG CTAGGCACTG GCCGTCGTTT TACAACGTCG TG ACTGGGAA - 



2 581 AACTGCTAGC 

2 641 AAACTACCTA 

27 01 GTTAAAC TAG 

2 761 AAATATTATA 

2 821 AAGGCTCATT 

2 881 ACATTTGTAG 

2 941 CATAAAATGA 

3 001 TAAAGCAATA 
3 061 GGTTTGTCCA 
3121 CGGCCAACGC 
3181 CACCGATCGC 
3241 CGGCGCATTA 
33 01 CGCCCTAGCG 
3361 TCCCCGTCAA 
3421 CCTCGACCCC 
3481 GACGGTTTTT 
3 541 AACTGGAACA 
3 6 01 GATTTCGGCC 
3661 CAAAATATTA 
3 721 TATTTGTTTA 
3781 GGTGAGAACG 
3 841 TGTGCGATAG 
3 9 01 ATGTGTGCCC 

3 961 AAGGAAGAGT 

4 021 TTGCCTTCCT 
4 081 GTTGGGTGCA 
4141 TTTTCGCCCC 
42 01 GGTATTATCC 
42 61 GAATGACTTG 
4321 AAGAGAATTA 
4381 GACAACGATC 
4441 AACTCGCCTT 
4501 CACCACGATG 
4 561 TACTCTAGCT 
4 621 ACTTCTGCGC 
4681 GCGTGGGTCT 
4 741 AGTTATCTAC 
48 01 GATAGGTGCC 
4861 TTAGATTGAT 
4 921 TAATCTCATG 

4 981 CCCTTAACGT 

5 041 TTCTTGAGAT 
5101 ACCAGCGGTG 
5161 CTTCAGCAGA 
52 21 CTTCAAGAAC 
5'2 81 TGCTGCCAGT 
5341 TAAGGCGCAG 
54 01 GACCTACACC 
54 6 1* AGGGAGAAAG 
5521 GGAGCTTCCA 
5581 ACTTGAGCGT 
5641 CAACGCGGCC 
57 01 TGCGTTATCC 

57 61 TCGCCGCAGC 
5821 AATAC GC AAA 

58 81 GCGCGTTTTT 

5 941 ATTTGAATGT 

6 001 GCCACCTGAC 



TTGGGATCTT 
CAGAGATTTA 
CTGCATATGC 
CACAGGAGCT 
TCAGGCCCCT 
AGGTTTTACT 
ATGCAATTGT 
G C ATCAC AAA 
AACTCATCAA 
GCGGGGAGAG 
CCTTCCCAAC 
AGCGCGGCGG 
CCCGCTCCTT 
GCTCTAAATC 
AAAAAACTTG 
CGCCCTTTGA 
ACACTCAACC 
TATTGGTTAA 
ACGTTTACAA 
TTTTTCTAAA 
GCTTGCTCGG 
AGGGAAGTCG 
ACCCCTGGCA 
ATGAGTATTC 
GTTTTTGCTC 
CGAGTGGGTT 
GAAGAACGTT 
CGTATTGACG 
GTTGAGTACT 
TGCAGTGCTG 
GGAGGACCGA 
GATCGTTGGG 
CCTGTAGCAA 
TCCCGGCAAC 
TCGGCCCTTC 
CGCGGTATCA 
ACGACGGGGA 
TCACTGATTA 
TTAAAACTTC 
CCATAACTTC 
GAGTTTTCGT 
CCTTTTTTTC 
GTTTGTTTGC 
GCGCAGATAC 
TCTGTAGCAC 
GGCGATAAGT 
CGGTCGGGCT 
GAACTGAGAT 
GCGGACAGGT 
GGGGGAAACG 
CGATTTTTGT 
TTTTTACGGT 
CCTGATTCTG 
CGAACGACCG 
CCGCCTCTCC 
CAATATTATT 
ATTTAGAAAA 
GTCTAAGAAA 



TGTGAAGGAA 
AAGCTCTAAG 
TTGCTGCTTG 
AGTGATTCTA 
CAGTCCTCAC 
TGCTTTAAAA 
TGTTGTTAAC 
TTTCACAAAT 
TGTATCTTAT 
GCGGTTTGCG 
AGTTGCGCAG 
GTGTGGTGGT 
TCGCTTTCTT 
GGGGGCTCCC 
ATTAGGGTGA 
CGTTGGAGTC 
CTATCTCGGT 
AAAATGAGCT 
TTTCAGGTGG 
TACATTCAAA 
CAGCTTCGAT 
CATTGAATTA 
TGAGACAATA 
AACATTTCCG 
ACCCAGAAAC 
ACATCGAACT 
TTCCAATGAT 
CCGGGCAAGA 
CACCAGTCAC 
C C AT AAC CAT 
AGGAGCTAAC 
AACCGGAGCT 
TGGCAACAAC 
AATTAATAGA 
CGGCTGGCTG 
TTGCAGCACT 
GTCAGGCAAC 
AGCATTGGTA 
ATTTTTAATT 
GTATAATGTA 
TCCACTGAGC 
TGCGCGTAAT 
CGGATCAAGA 
CAAATACTGT 
CGCCTACATA 
CGTGTCTTAC 
GAACGGGGGG 
ACCTACAGCG 
ATCCGGTAAG 
CCTGGTATCT 
GATGCTCGTC 
TCCTGGCCTT 
TGGATAACCG 
AGCGCAGCGA 
CCGCGCGTTG 
GAAGCATTTA 
ATAAACAAAT 
CCATT 



CCTTACTTCT 
GTAAATATAA 
AGAGTTTTGC 
ATTGTTTGTG 
AGTCTGTTCA 
AACCTCCCAC 
TTGTTTATTG 
AAAGCATTTT 
CATGTCTGGA 
TATTGGCTGG 
CCTGAATGGC 
TACGCGCAGC 
CCCTTCCTTT 
TTTAGGGTTC 
TGGTTCACGT 
CACGTTCTTT 
CTATTCTTTT 
GATTTAACAA 
CACTTTTCGG 
TATGTATCCG 
GTGTGCTGGA 
TGTGCTGTGT 
ACCCTGATAA 
TGTCGCCCTT 
GCTGGTGAAA 
GGATCTCAAC 
GAGCACTTTT 
GCAACTCGGT 
AGAAAAGCAT 
GAGTGATAAC 
CGCTTTTTTG 
GAATGAAGCC 
GTTGCGCAAA 
CTGGATGGAG 
GTTTATTGCT 
GGGGCCAGAT 
TATGGATGAA 
ACTGTCAGAC 
TAAAAGGATC 
TGCTATACGA 
GTCAGACCCC 
CTGCTGCTTG 
GCTACCAACT 
CCTTCTAGTG 
CCTCGCTCTG 
CGGGTTGGAC 
TTCGTGCACA 
TGAGCATTGA 
CGGCAGGGTC 
TTATAGTCCT 
AGGGGGGCGG 
TTGCTGGCCT 
TATTACCGCC 
GTCAGTGAGC 
GCCGATTCAT 
TCAGGGTTAT 
AGGGGTTCCG 



GTGGTGTGAC 
AATTTTTAAG 
TTACTGAGTA 
TATTTTAGAT 
TGATCATAAT 
ACCTCCCCCT 
CAG CTTATAA 
TTTCACTGCA 
TCGATCCTGC 
CGTAATAGCG 
GAATGGGACG 
GTGACCGCTA 
CTCGCCACGT 
CGATTTAGTG 
AGTGGGCCAT 
AATAGTGGAC 
GATTTATAAG 
AAATTTAACG 
GGAAATGTGC 
CTCATGCCAG 
GGGAGAATAA 
AGGGATCGCT 
ATGCTTCAAT 
ATTCCCTTTT 
G TAAAAG ATG 
AG C GGTAAG A 
AAAGTTCTGC 
CGCCGCATAC 
CTTACGGATG 
ACTGCGGCCA 
CACAACATGG 
ATACCAAACG 
CTATTAACTG 
GCGGATAAAG 
GATAAATCTG 
GGTAAG CCCT 
C GAAAT AG AC 
CAAGTTTACT 
TAGGTGAAGA 
AGTTATGGCA 
GTAGAAAAGA 
CAAACAAAAA 
CTTTTTCCGA 
TAG C C GTAGT 
CTAATCCTGT 
TCAAGACGAT 
CAGCCCAGCT 
GAAAGCGCCA 
GGAACAGGAG 
GTCGGGTTTC 
AGCCTATGGA 
TTTGCTCACA 
TTTGAGTGAG 
GAGGAAGCGG 
TAATGCAGAG 
TGTCTCATGA 
CGCACATTTC 



ATAATTGGAC 
TGTATAATGT 
TGATTTATGA 
TCACAGTCCC 
CAGCCATACC 
GAACCTGAAA 
TGGTTACAAA 
TTCTAGTTGT 
ATTAATGAAT 
AAGAGGCCCG 
CGCCCTGTAG 
CACTTGCCAG 
TCGCCGGCTT 
CTTTACGGCA 
CGCCCTGATA 
TCTTGTTCCA 
GGATTTTGCC 
CGAATTTTAA 
GCGGAACCCC 
GTCTTGGACT 
AGGTCTAAGA 
GGTATCAAAT 
AATATTGAAA 
TTGCGGCATT 
CTGAAGATCA 
TCCTTGAGAG 
TATGTGGCGC 
ACTATTCTCA 
GC ATGACAGT 
ACTTACTTCT 
GGGATCATGT 
ACGAGCGTGA 
GCGAACTACT 
TTGCAGGACC 
GAGCCGGTGA 
CCCGTATCGT 
AGATCGCTGA 
CATATATACT 
TCCTTTTTGA 
TG AC C AAAAT 
TCAAAGGATC 
AACCACCGCT 
AGGTAACTGG 
TAGGCCACCA 
TAC CAGTGGC 
AGTTACCGGA 
TGGAGCGAAC 
CGCTTCCCGA 
AGCGCACGAG 
GCCACCTCTG 
AAAACGCCAG 
TGTTCTTTCC 
CTGATACCGC 
AAGAGCGCCC 
CTTGCAATTC 
GCGGATACAT 
CCCGAAAAGT 



Figure pDEST8 Polyhedron Promoter, Baculo virus 

Transfer Plasmid 



1 cgtUta etc egg aat att aat aga tea tgg aga taa tta aaa tga taa cca 
gca~t^ gag gee tta taa tta tct agt acc tct att aat ttt act att act 

52 tct cgc aaa taa ata lagt att tta 7 ctg ttt teg taa cag ttt tgt aat aaa 
aga gcg ttt att tat tea taa aat gac aaa age att gtc aaa aca tta ttt 

103 aaa acc tat aaa tat tec gga tta ttc ata ccg tec cac cat egg gcg cSo 
(tdm) ttfc a9Sf ° Ct ^ aag tat " C a " 9tg 9ta gcc cgc 



154 Hf ilZ fcf agC c 4a^?£l*^ a ^^aa y ega oaa apg taa^Tat >ga ty Sta 
tag tag fcgt tea aa c 1 atq ttt^tb ^T coi c^rt^^r^ at^ta^t j/?^ 



SnaB I 6525 




SsaH I 3481 





pDEST8 6526 bp 

T,nr a t-inn (Rase Hos.) Gene Encoded 

23.. 152 Ppolh 

284 . . 160 attRl 

534 . .1193 CmR 

1313.. 1397 inactivated ccdA 

1535 . . 1840 ccdB 

1881 . .2005 attR2 

2766 . .3146 f 1 

3240. .4090 ampR 

4289 . .4869 ori 

5564 . .6496 genR 

1 CGTATACTCC GGAATATTAA TAGATCATGG AGATAATTAA AATGATAACC ATCTCGCAAA 
61 TAAATAAGTA TTTTACTGTT TTCGTAACAG TTTTGTAATA AAAAAACCTA TAAAT ATT C C 
121 GGATTATTCA TACCGTCCCA CCATCGGGCG CGGATCATCA CAAGTTTGTA CAAAAAAGCT 
181 GAACGAGAAA CGTAAAATGA TATAAATATC AATATATTAA ATTAGATTTT GCATAAAAAA 
241 CAGACTACAT AATACTGTAA AACACAACAT ATCCAGTCAC TATGGCGGCC GCTAAGTTGG 
3 01 CAGCATCACC CGACGCACTT TGCGCCGAAT AAATACCTGT GACGGAAGAT CACTTCGCAG 
361 AATAAATAAA TCCTGGTGTC C CTGTTG AT A CCGGGAAGCC CTGGGCCAAC TTTTGGCGAA 
421 AATGAGACGT TGATCGGCAC GTAAGAGGTT CCAACTTTCA CCATAATGAA ATAAGATCAC 
481 TACCGGGCGT ATTTTTTGAG TTATC GAG AT TTTCAGGAGC TAAGGAAGCT AAAATGGAGA 
541 AAAAAATCAC TGGATATACC ACCGTTGATA TATCCCAATG GCATCGTAAA GAACATTTTG 
6 01 AGGCATTTCA GTCAGTTGCT CAATGTACCT ATAAC C AG AC CGTTCAGCTG GATATTACGG 
661 CCTTTTTAAA GACCGTAAAG AAAAATAAGC ACAAGTTTTA TCCGGCCTTT ATTCACATTC 
721 TTGCCCGCCT GATGAATGCT CATCCGGAAT TCCGTATGGC AATGAAAGAC GGTGAGCTGG 
781 TGATATGGGA TAGTGTTCAC CCTTGTTACA CCGTTTTCCA TGAGC AAAC T GAAACGTTTT 
841 CATCGCTCTG GAGTGAATAC CACGACGATT TCCGGCAGTT TCTACACATA TATTCGCAAG 
9 01 ATGTGGCGTG TTACGGTGAA AACCTGGCCT ATTTCCCTAA AGGGTTTATT GAGAATATGT 
961 TTTTCGTCTC AGCCAATCCC TGGGTGAGTT TCACCAGTTT TGATTTAAAC GTGGCCAATA 
1021 TGGACAACTT CTTCGCCCCC GTTTTCACCA TGGGCAAATA TTATACGCAA GGCGACAAGG 
1081 TGCTGATGCC GCTGGCGATT CAGGTTCATC ATGCCGTCTG TGATGGCTTC CATGTCGGCA 
1141 GAATGCTTAA TGAATTACAA CAGTACTGCG ATGAGTGGCA GGGCGGGGCG TAAACGCGTG 
12 01 GATCCGGCTT ACTAAAAG C C AGATAACAGT ATGCGTATTT GCGCGCTGAT TTTTGCGGTA 
12 61 TAAGAATATA TACTGATATG TATACCCGAA GTATGTCAAA AAGAGGTGTG CTATGAAGCA 
1321 G CGTATTAC A GTGACAGTTG ACAGCGACAG CTATCAGTTG CTCAAGGCAT ATATGATGTC 
1381 AATATCTCCG GTCTGGTAAG CACAACCATG CAGAATGAAG CCCGTCGTCT GCGTGCCGAA 
1441 CGCTGGAAAG CGGAAAATCA GGAAGGGATG GCTGAGGTCG CCCGGTTTAT TGAAATGAAC 
1501 GGCTCTTTTG CTGACGAGAA CAGGGACTGG TGAAATGCAG TTTAAGGTTT AC AC CTATAA 
1561 AAG AG AG AG C CGTTATCGTC TGTTTGTGGA TGTACAGAGT GATATTATTG ACACGCCCGG 
1621 GCGACGGATG GTGATCCCCC TGGCCAGTGC ACGTCTGCTG TCAGATAAAG TCTCCCGTGA 
16 81 ACTTTACCCG GTGGTGCATA TCGGGGATGA AAGCTGGCGC ATGATGACCA CCGATATGGC 
1741 CAGTGTGCCG GTCTCCGTTA TCGGGGAAGA AGTGGCTGAT CTCAGCCACC GCGAAAATGA 
.18 01 CATCAAAAAC GCCATTAACC TGATGTTCTG GGGAATATAA ATGTCAGGCT CCCTTATACA 
'l861 CAGCCAGTCT GCAGGTCGAC CATAGTGACT GGATATGTTG TGTTTTACAG TATTATGTAG 
1921 TCTGTTTTTT ATGCAAAATC TAATTTAATA TATTGATATT TATATCATTT TACGTTTCTC 
1981 GTTCAGCTTT CTTGTACAAA GTGGTGATAG CTTGTCGAGA AGTAC TAG AG GATCATAATC 
2 041 s AGCCATACCA CATTTGTAGA GGTTTTACTT GCTTTAAAAA ACCTCCCACA CCTCCCCCTG 
2101 AACCTGAAAC ATAAAATGAA TGCAATTGTT GTTGTTAACT TGTTTATTGC AG CTTAT AAT 
2161 GGTTACAAAT AAAG C AAT AG CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT 
2221 TCTAGTTGTG GTTTGTCCAA ACTCATCAAT GTATCTTATC ATGTCTGGAT CTGATCACTG 
22 81 CTTGAGCCTA GGAGATCCGA AC C AG AT AAG TGAAATCTAG TTCCAAACTA TTTTGT C ATT 
2341 TTTAATTTTC GTATTAGCTT ACGACGCTAC ACCCAGTTCC CATCTATTTT GTCACTCTTC 
24 01 C CTAAAT AAT CCTTAAAAAC TCCATTTCCA CCCCTCCCAG TTCCCAACTA TTTTGTCCGC 
24 61 CCACAGCGGG GCATTTTTCT TCCTGTTATG TTTTTAATCA AACATCCTGC CAACTCCATG 
2 521 TGACAAACCG TCATCTTCGG CTACTTTTTC TCTGTCACAG AATGAAAATT TTTCTGTC AT 




2581 CTCTTCGTTA TTAATGTTTG TAATTGACTG AATATCAACG CTTATTTGCA GCCTGAATGG 
2 641 CGAATGGACG CGCCCTGTAG CGGCGCATTA AGCGCGGCGG GTGTGGTGGT TACGCGCAGC 
27 01 GTGACCGCTA CACTTGCCAG CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT CCCTTCCTTT 
27 61 CTCGCCACGT TCGCCGGCTT TCCCCGTCAA GCTCTAAATC GGGGGCTCCC TTTAGGGTTC 
2 821 CGATTTAGTG CTTTACGGCA CCTCGACCCC AAAAAACTTG ATTAGGGTGA TGGTTCACGT 
2881 AGTGGGCCAT CGCCCTGATA GACGGTTTTT CGCCCTTTGA CGTTGGAGTC CACGTTCTTT 

2 941 AATAGTGGAC TCTTGTTCCA AACTGGAACA ACACTCAACC CTATCTCGGT CTATTCTTTT 

3 001 GATTTATAAG GGATTTTGCC GATTTCGGCC TATTGGTTAA AAAATGAGCT GATTTAACAA 
3 061 AAATTTAACG CGAATTTTAA CAAAATATTA ACGTTTACAA TTTCAGGTGG CACTTTTCGG 
3121 GGAAATGTGC GCGGAACCCC TATTTGTTTA TTTTTCTAAA TAC ATT C AAA TATGTATCCG 
3181 CTCATGAGAC AATAACCCTG ATAAATGCTT CAATAATATT GAAAAAGGAA GAGTATGAGT 
3241 ATTCAACATT TCCGTGTCGC CCTTATTCCC TTTTTTGCGG CATTTTGCCT TCCTGTTTTT 
3 3 01 GCTCACCCAG AAACGCTGGT GAAAGTAAAA GATGCTGAAG ATCAGTTGGG TGCACGAGTG 
3361 GGTTACATCG AACTGGATCT CAACAGCGGT AAGATCCTTG AGAGTTTTCG CCCCGAAGAA 
3421 CGTTTTCCAA TGATGAGCAC TTTTAAAGTT CTGCTATGTG GCGCGGTATT ATCCCGTATT 
34 81 GACGCCGGGC AAGAGCAACT CGGTCGCCGC ATACACTATT CTCAGAATGA CTTGGTTGAG 
3 541 TACTCACCAG T C AC AG AAAA GCATCTTACG GATGGCATGA CAGTAAGAGA ATTATGCAGT 
36 01 GCTGCCATAA C C ATGAGTG A TAACACTGCG GCCAACTTAC TTCTGACAAC GATCGGAGGA 
3 661 CCGAAGGAGC TAACCGCTTT TTTGCACAAC ATGGGGGATC ATGTAACTCG CCTTGATCGT 
3721 TGGGAACCGG AG CTGAATGA AGCCATACCA AACGACGAGC GTGAC AC C AC GATGCCTGTA 
3 781 GCAATGGCAA CAACGTTGCG CAAACTATTA ACTGGCGAAC TACTTACTCT AGCTTCCCGG 
3 841 CAACAATTAA TAGACTGGAT GGAGGCGGAT AAAGTTGCAG GAC C ACTTCT GCGCTCGGCC 
3 9 01 CTTCCGGCTG GCTGGTTTAT TGCTGATAAA TCTGGAGCCG GTGAGCGTGG GTCTCGCGGT 

3 961 ATCATTGCAG CACTGGGGCC AGATGGTAAG CCCTCCCGTA TCGTAGTTAT CTACACGACG 

4 021 GGGAGTCAGG CAACTATGGA TGAACGAAAT AG ACAGATC G CTGAGATAGG TGCCTCACTG 
4 081 ATTAAGCATT GGTAACTGTC AGACCAAGTT TACTCATATA TACTTTAGAT TGATTTAAAA 
4141 CTTCATTTTT AATTTAAAAG GATCTAGGTG AAGATCCTTT TTGATAATCT C ATG AC C AAA 
42 01 ATCCCTTAAC GTGAGTTTTC GTTCCACTGA GCGTCAGACC CCGTAGAAAA GATCAAAGGA 
4261 TCTTCTTGAG ATCCTTTTTT TCTGCGCGTA ATCTGCTGCT TGCAAACAAA AAAACCACCG 
4 321 CTACCAGCGG TGGTTTGTTT GCCGGATCAA GAGCTACCAA CTCTTTTTCC GAAGGTAACT 
4 381 GGCTTCAGCA GAG CGCAGAT ACCAAATACT GTCCTTCTAG TGTAGCCGTA GTTAGGCCAC 
4441 CACTTCAAGA ACTCTGTAGC ACCGCCTACA TACCTCGCTC TGCTAATCCT GTTACCAGTG 
4501 GCTGCTGCCA GTGGCGATAA GTCGTGTCTT ACCGGGTTGG ACTCAAGACG ATAGTTACCG 
4 561 GATAAGGCGC AGCGGTCGGG CTGAACGGGG GGTTCGTGCA CACAGCCCAG CTTGGAGCGA 
4621 ACGACCTACA CCGAACTGAG ATAC CTACAG CGTGAGCATT GAGAAAGCGC CACGCTTCCC 
4681 GAAGGGAGAA AGGCGGACAG GTATCCGGTA AGCGGCAGGG TCGGAACAGG AGAGCGCACG 
4 741 AGGGAGCTTC CAGGGGGAAA CGCCTGGTAT CTTTATAGTC CTGTCGGGTT TCGCCACCTC 
4801 TGACTTGAGC GTCGATTTTT GTGATGCTCG TCAGGGGGGC GGAGCCTATG GAAAAACGCC 
4861 AGCAACGCGG CCTTTTTACG GTTCCTGGCC TTTTGCTGGC CTTTTGCTCA CATGTTCTTT 
4 921 CCTGCGTTAT CCCCTGATTC TGTGGATAAC CGTATTACCG CCTTTGAGTG AGCTGATACC 

4 981 GCTCGCCGCA GCCGAACGAC CGAGCGCAGC GAGTCAGTGA GCGAGGAAGC GGAAGAGCGC 

5 041 CTGATGCGGT ATTTTCTCCT TACGCATCTG TGCGGTATTT CACACCGCAG ACCAGCCGCG 
5101 TAACCTGGCA AAATCGGTTA CGGTTGAGTA ATAAATGGAT GCCCTGCGTA AGCGGGTGTG 
5161 GGCGGACAAT AAAGTCTTAA ACTGAACAAA ATAGATCTAA ACTATGACAA TAAAGTCTTA 

,5221 AACTAGACAG AATAGTTGTA AACTGAAATC AGTCCAGTTA TGCTGTGAAA AAGCATACTG 
52 81 GAC TTTTGTT ATGGCTAAAG CAAACTCTTC ATTTTCTGAA GTGCAAATTG CCCGTCGTAT 
5 341 TAAAGAGGGG CGTGGCCAAG GGCATGGTAA AGACTATATT CGCGGCGTTG TGACAATTTA 
54 01 CCGAACAACT CCGCGGCCGG GAAGCCGATC TCGGCTTGAA CGAATTGTTA GGTGGCGGTA 
5 4 61". CTTGGGTCGA TATCAAAGTG CATCACTTCT TCCCGTATGC CCAACTTTGT AT AG AG AG C C 
5521 ACTGCGGGAT CGTCACCGTA ATCTGCTTGC ACGTAGATCA CATAAGCACC AAGCGCGTTG 
5581 GCCTCATGCT TGAGGAGATT GATGAGCGCG GTGGCAATGC CCTGCCTCCG GTGCTCGCCG 
5 641 GAGACTGCGA GATCATAGAT ATAGATCTCA CTACGCGGCT GCTCAAACCT GGGCAGAACG 
57 01 TAAGCCGCGA GAGCGCCAAC AACCGCTTCT TGGTCGAAGG CAGCAAGCGC GATGAATGTC 
57 61 TTACTACGGA GCAAGTTCCC GAGGTAATCG GAGTCCGGCT GATGTTGGGA GTAGGTGGCT 
5821 ACGTCTCCGA ACTCACGACC GAAAAGATCA AGAGCAGCCC GCATGGATTT GACTTGGTCA 
5881 GGGCCGAGCC TACATGTGCG AATGATGCCC ATACTTGAGC CACCTAACTT TGTTTTAGGG 

5 941 CGACTGCCCT GCTGCGTAAC ATCGTTGCTG CTGCGTAACA TCGTTGCTGC TCCATAACAT 

6 001 CAAACATCGA CCCACGGCGT AACGCGCTTG CTGCTTGGAT GCCCGAGGCA TAGACTGTAC - 



6 061 AAAAAAACAG TCATAACAAG CCATGAAAAC 

6121 GGTCAAGGTT CTGGACCAGT TGCGTGAGCG 

6181 CGAACAGGCT TATGTCAACT GGGTTCGTGC 

6241 GGCAACCTTG GGCAGCAGCG AAGTCGAGGC 

6 3 01 GGTTTCGGTC TCCACGCATC GTCAGGCATT 

6 3 61 GCTGTGCACG GATCTGCCCT GGCTTCAGGA 

6421 GCCGGTGGTG CTGACCCCGG ATGAAGTGGT 

64 81 TCGTTTGTTC GCCCAGGACT C TAG C TAT AG 



CGCCACTGCG CCGTTACCAC CGCTGCGTTC 
CATACGCTAC TTGCATTACA GTTTACGAAC 
CTTCATCCGT TTCCACGGTG TGCGTCACCC 
ATTTCTGTCC TGGCTGGCGA ACGAGCGCAA 
GGCGGCCTTG CTGTTCTTCT ACGGCAAGGT 
GATCGGAAGA CCTCGGCCGT CGCGGCGCTT 
TCGCATCCTC GGTTTTCTGG AAGGCGAGCA 
TTCTAGTGGT TGGCTA 



Figure ^A; p ]>^>"r? Semliki Forest Virus vector 



103 ttg gcg agg gac att aag gcg ttt aag aaa ttg aga gga cct gtt ata 
aac cgc tec ctg taa ttc cgc aaa ttc ttt aac tct cct gga caa tat 




154 —fete tac " Jgc ggt cct agt ttg gfcg cgt taa tac aca gaa ttc tga ttg Lgat 
gag atg ccg cca gga tct aac epe gca att atg tgt ctt aag act aac eta 
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ccc gjjt^bcg aag cgc get ttc cca tea I aca age c 3<^ga£ ^ajt^ ^a 
l^rg cca ^gc ttc gcg cga aag ggt agt jtgt tea aac atg ttt|r 




pDEST9 12464 bp 



Location (Base Nos . ) 
355 . .232 
605 . . 1264 
1384 . . 1468 
1606 . . 1911 
1952 . .2078 
2532 . .2782 
3482 . .4282 
5232 . .5365 
5365. .6965 
6965 . . 9265 
9265 . . 10865 
10865 . . 161 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ori 

ampR 

SP6 promoter 

nsPl : non-structural protein 1 
nsP2 ; non- structural protein 2 
nsP3 : non-structural protein 3 
nsP4 : non- structural protein 4 



1 AGCAAGTGGT TCCGGACAGG CTTGGGGGCC 
61 GAGGTAGAGG GCTGCAAAAG TATCCTCATA 
121 GCGTTTAAGA AATTGAGAGG ACCTGTTATA 
181 TAATACACAG AATTCTGATT GGATCCCGGT 
241 ACAAAAAAGC TGAACGAGAA ACGTAAAATG 
3 01 TGCATAAAAA ACAGACTACA TAATACTGTA 
361 CGCTAAGTTG GCAGCA%CAC CCGACGCACT 
421 TCACTTCGCA GAATAAATAA ATCCTGGTGT 
481 CTTTTGGCGA AAATGAGACG TTGATCGGCA 
541 AATAAGATCA CTACCGGGCG T ATTTTTT G A 
6 01 TAAAATGGAG AAAAAAATCA CTGGATATAC 
661 AGAACATTTT GAGGCATTTC AGTCAGTTGC 
721 GGATATTACG GCCTTTTTAA AGAC CGTAAA 
781 TATTCACATT CTTGCCCGCC TGATGAATGC 
841 CGGTGAGCTG GTGATATGGG ATAGTGTTCA 
9 01 TGAAACGTTT TCATCGCTCT GGAGTGAATA 
961 ATATTCGCAA GATGTGGCGT GTTACGGTGA 
1021 TGAGAATATG TTTTTCGTCT CAGCCAATCC 
1081 CGTGGCCAAT ATGGACAACT TCTTCGCCCC 
1141 AGGCGACAAG GTGCTGATGC CGCTGGCGAT 
12 01 CCATGTCGGC AGAATGCTTA ATGAATTACA 

12 61 GTAAAGATCT GGATCCGGCT TACTAAAAGC 

13 21 TTTTTGCGGT ATAAGAATAT ATACTGATAT 
13 81 GCTATGAAGC AGCGTATTAC AGTGACAGTT 
1441 TATATGATGT CAATATCTCC GGTCTGGTAA 
1501 TGCGTGCCGA ACGCTGGAAA GCGGAAAATC 
15 61 TTGAAATGAA CGGCTCTTTT GCTGACGAGA 
1621 TACACCTATA AAAGAGAGAG CCGTTATCGT 

,16 81 GACACGCCCG GGCGACGGAT GGTGATCCCC 
1741 GTCTCCCGTG AACTTTACCC GGTGGTGCAT 
18 01 ACCGATATGG CCAGTGTGCC GGTCTCCGTT 
1861 CGCGAAAATG ACATCAAAAA CGCCATTAAC 
192 1 "-TCCCTTATAC AC AG C C AGTC TGCAGGTCGA 
1981 GTATTATGTA GTCTGTTTTT TATGCAAAAG 
2 041 TTTTACGTTT CTCGTTCAGC TTTCTTGTAC 
2101 TCGATCCCGC GGCCGCTTTC GAACCTAGGC 
2161 AATTAC ATC C CTACGCAAAC GTTTTACGGC 
2221 CCTTGGCCGT TGCAGGCCAC TCCGGTGGCT 
2281 ATGCAGCAAC TCATCAGCGC CGTAAATGCG 
2 341 GCTAGGAGCT TAATTCGACG AATAATTGGA 
2 4 01 TATTTC CAAA AAAAAAAAAA AAAAAAAAAA 



GAACTGGAGG TGGCACTAAC ATGTAGGTAT 
GCCATGGCCA CCTTGGCGAG GGACATTAAG 
CACCTCTACG GCGGTCCTAG ATTGGTGCGT 
CCGAAGCGCG CTTTCCCATC ACAAGTTTGT 
ATATAAATAT CAATATATTA AATTAGATTT 
AAACACAACA TATCCAGTCA CTATGGCGGC 
TTGCGCCGAA TAAATAC CTG TGACGGAAGA 
CCCTGTTGAT ACCGGGAAGC CCTGGGCCAA 
CGTAAGAGGT TCCAACTTTC AC C ATAATGA 
GTTAT C GAGA TTTTCAGGAG CTAAGGAAGC 
CACCGTTGAT ATATCCCAAT GGCATCGTAA 
TCAATGTACC TATAACCAGA CCGTTCAGCT 
GAAAAATAAG CACAAGTTTT ATCCGGCCTT 
TCATCCGGAA TTCCGTATGG CAATGAAAGA 
CCCTTGTTAC ACCGTTTTCC ATGAG C AAAC 
CCACGACGAT TTCCGGCAGT TTCTACACAT 
AAACCTGGCC TATTTCCCTA AAGGGTTTAT 
CTGGGTGAGT TTCACCAGTT TTGATTTAAA 
CGTTTTCACC ATGGGCAAAT ATTATAC G C A 
TCAGGTTCAT CATGCCGTCT GTGATGGCTT 
ACAGTACTGC GATGAGTGGC AGGGCGGGGC 
CAGATAACAG TATG CGTATT TGCGCGCTGA 
GTATACCCGA AGTATGTCAA AAAGAGGTGT 
G AC AG C G AC A GCTATCAGTT GCTCAAGGCA 
G C AC AAC CAT GCAGAATGAA GCCCGTCGTC 
AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA 
ACAGGGACTG GTGAAATGCA GTTTAAGGTT 
CTGTTTGTGG ATGTACAGAG TGATATTATT 
CTGGCCAGTG CACGTCTGCT GTCAGATAAA 
ATCGGGGATG AAAGCTGGCG CATGATG AC C 
ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC 
CTGATGTTCT GGGGAATATA AATGTCAGGC 
C C ATAGTG AC TGGATATGTT GTGTTTTACA 
TGCTAATTTA ATATATTGAT ATTT AT AT C A 
AAAGTGGTGA TGGGAACTCG AGTTCACTAG 
AAGCATGCGG GCCCAGTGGG TAATTAATTG 
CGCCGGTGGC GCCCGCGCCC GGCGGCCCGT 
CCCGTCGTCC CCGACTTCCA GGCCCAGCAG 
CTGACAATGA GACAGAACGC AATTGCTCCT 
TTTTTATTTT ATTTTGCAAT TGGTTTTTAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA- 



24 61 AAAAAAAAAA AAAAAAACTA GAAATCGCGA TTTCTAGTCT GCATTAATGA ATCGGCCAAC 
2 521 GCGCGGGGAG AGGCGGTTTG CGTATTGGGC GCTCTTCCGC TTCCTCGCTC ACTGACTCGC 
2 581 TGCGCTCGGT CGTTCGGCTG CGGCGAGCGG TATCAGCTCA CTCAAAGGCG GTAATAC GGT 
2 641 TATCCACAGA ATCAGGGGAT AACGCAGGAA AGAACATGTG AGCAAAAGGC CAGCAAAAGG 
27 01 CCAGGAACCG TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 

27 61 AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGG C GAAA CCCGACAGGA CTATAAAGAT 
2 821 ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA 

28 81 CCGGATACCT GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAA TGCTCGCGCT 

2 941 GTAGGTATCT CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC 

3 001 CCGTTCAGCC CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACC CGGTAA 
3 061 GACACGACTT ATCGCCACTG G C AG C AG C C A CTGGTAACAG GATTAGCAGA GCGAGGTATG 
3121 TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG 
3181 TATTTGGTAT CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT GGTAGCTCTT 
3241 GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA 
3 3 01 CGCGCAGAAA AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC 
3361 AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA AGGATCTTCA 
3 421 CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT CTAAAGTATA TATGAGTAAA 
34 81 CTTGGTCTGA CAGTTACCAA TGCTTAATCA GTGAGGCACC TATCTCAGCG ATCTGTCTAT 
3 541 TTCGTTCATC CATAGTTGCC TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT 
36 01 TAC CATCTGG CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT 
3661 TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT GCAACTTTAT 
3721 CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA 
3781 ATAGTTTGCG CAACGTTGTT GCCATTGCTA CAGGCATCGT GGTGTCACGC TCGTCGTTTG 
3 841 GTATGGCTTC ATTCAGCTCC GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT 
3 9 01 TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG 

3 961 CAGTGTTATC ACTCATGGTT ATGG C AG C AC TGCATAATTC TCTTACTGTC ATGCCATCCG 

4 021 TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC 
4 081 GGCGACCGAG TTGCTCTTGC CCGGCGTCAA TACGGGATAA TACCGCGCCA CATAGCAGAA 
4141 CTTTAAAAGT GCTCATCATT GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC 
42 01 CGCTGTTGAG ATCCAGTTCG ATGTAAC C C A CTCGTGCACC CAACTGATCT TCAGCATCTT 
42 61 TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC GCAAAAAAGG 
4 321 GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT CCTTTTTCAA TATTATTGAA 
4381 GCATTTATCA GGGTTATTGT CTCATGAGCG GATACATATT TGAATGTATT TAGAAAAATA 
4441 AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC ACCTGACGTC TAAGAAACCA 
4 501 TTATTATCAT G AC ATTAAC C TATAAAAATA GGCGTATCAC GAGGCCCTTT CGTCTCGCGC 
4 561 GTTTCGGTGA TGACGGTGAA AACCTCTGAC ACATGCAGCT CCCGGAGACG GTC AC AG CTT 
4 621 CTGTCTAAGC GGATGCCGGG AGCAGACAAG CCCGTCAGGG CGCGTCAGCG GGTGTTGGCG 
46 81 GGTGTCGGGG CTGGCTTAAC TATGCGGCAT CAGAGCAGAT TGTACTGAGA GTGC AC CAT A 
4 741 TCGACGCTCT CCCTTATGCG ACTCCTGCAT TAGGAAGCAG CCCAGTACTA GGTTGAGGCC 
4 8 01 GTTG AG C AC C GCCGCCGCAA GGAATGGTGC ATGCAAGGAG ATGGCGCCCA ACAGTCCCCC 
4 861 GGCCACGGGG CCTGCCACCA TACCCACGCC GAAACAAGCG CTCATGAGCC CGAAGTGGCG 
4 921 AGCCCGATCT TCCCCATCGG TGATGTCGGC GATATAGGCG CCAGCAACCG CACCTGTGGC 

4 981 GCCGGTGATG CCGGCCACGA TGCGTCCGGC GT AG AG GAT C TGGCTAGCGA TGACCCTGCT 

5 041 GATTGGTTCG CTGACCATTT CCGGGGTGCG GAACGGCGTT ACCAGAAACT CAGAAGGTTC 
5101 GTCCAACCAA ACCGACTCTG ACGG CAGTTT ACGAGAGAGA TGATAGGGTC TGCTTCAGTA 

7 51 61 AGCCAGATGC TACACAATTA GGCTTGTACA TATTGTCGTT AGAACGCGGC TACAATTAAT 
5221 AC ATAAC CTT ATGTATCATA CACATACGAT TTAGGTGACA CTATAGATGG CGGATGTGTG 
52 81 ACATACACGA C G C C AAAAG A TTTTGTTCCA GCTCCTGCCA CCTCCGCTAC G C GAG AG ATT 
5341vAACCACCCAC GATGGCCGCC AAAGTGCATG TTGATATTGA GGCTGACAGC CC ATT CATC A 
54 01 AGTCTTTGCA GAAGGCATTT CCGTCGTTCG AGGTGGAGTC ATTGCAGGTC AC AC C AAATG 
54 61 ACCATGCAAA TG C C AG AG C A TTTTCGCACC TGGCTACCAA ATTGATCGAG CAGGAGACTG 
5 521 ACAAAGACAC ACTCATCTTG GAT AT C GG C A GTGCGCCTTC CAGGAGAATG ATGTCTACGC 
5 581 ACAAATACCA CTGCGTATGC CCTATGCGCA GCGCAGAAGA CCCCGAAAGG CTCGATAGCT 
5 641 ACGCAAAGAA ACTGGCAGCG GCCTCCGGGA AGGTGCTGGA TAGAGAGATC GCAGGAAAAA 

57 01 TCACCGACCT GCAGACCGTC ATGGCTACGC CAGACGCTGA ATCTCCTACC TTTTGCCTGC 
5761 ATACAGACGT CACGTGTCGT ACGGCAGCCG AAGTGGCCGT ATACCAGGAC GTGTATGCTG 
5821 TACATGCACC AACATCGCTG TACCATCAGG CGATGAAAGG TGTCAGAACG GCGTATTGGA 

58 81 TTGGGTTTGA CACCACCCCG TTTATGTTTG ACGCGCTAGC AGGCGCGTAT CCAACCTACG- 




5 941 CCACAAACTG GGCCGACGAG CAGGTGTTAC AGGCCAGGAA CATAGGACTG TGTGCAGCAT 

6 001 CCTTGACTGA GGGAAGACTC GGCAAACTGT CCATTCTCCG CAAGAAGCAA TTGAAACCTT 
6 061 GCGACACAGT CATGTTCTCG GTAGGATCTA CATTGTACAC TG AG AG C AG A AAGCTACTGA 
6121 GGAGCTGGCA CTTACCCTCC GTATTCCACC TGAAAGGTAA ACAATCCTTT ACCTGTAGGT 
6181 GCGATACCAT CGTATCATGT GAAGGGTACG TAGTTAAGAA AATCACTATG TGCCCCGGCC 
6241 TGTACGGTAA AACGGTAGGG TACGCCGTGA CGTATCACGC GGAGGGATTC CTAGTGTGCA 

63 01 AGACCACAGA CACTGTCAAA GGAGAAAGAG TCTCATTCCC TGTATGCACC TACGTCCCCT 
6361 CAACCATCTG TGATCAAATG ACTGGCATAC TAG CGAC C GA CGTCACACCG GAGGACGCAC 
6421 AGAAGTTGTT AGTGGGATTG AATCAGAGGA TAGTTGTGAA CGGAAGAACA CAGCGAAACA 

64 81 CTAACACGAT GAAGAACTAT CTGCTTCCGA TTGTGGCCGT CGCATTTAGC AAGTGGGCGA 
6541 GGGAATACAA GGCAGACCTT GATGATGAAA AACCTCTGGG TGTCCGAGAG AGGTCACTTA 
6 6 01 CTTGCTGCTG CTTGTGGGCA TTTAAAAC G A GGAAGATGCA C AC CATGTAC AAGAAACCAG 
6661 ACACCCAGAC AATAGTGAAG GTGCCTTCAG AGTTTAACTC GTTCGTCATC CCGAGCCTAT 
6721 GGTCTACAGG CCTCGCAATC CCAGTCAGAT CACGCATTAA GATGCTTTTG GCCAAGAAGA 
6781 CCAAGCGAGA GTTAATACCT GTTCTCGACG CGTCGTCAGC CAGGGATGCT GAACAAGAGG 
6841 AGAAGGAGAG GTTGGAGGCC GAGCTGACTA GAGAAGCCTT ACCACCCCTC GTCCCCATCG 
6 901 CGCCGGCGGA GACGGGAGTC GTCGACGTCG ACGTTGAAGA ACTAGAGTAT CACGCAGGTG 

6 961 CAGGGGTCGT GGAAAC AC CT CGCAGCGCGT TGAAAGTCAC CGCACAGCCG AACGACGTAC 

7 021 TACTAGGAAA TTACGTAGTT CTGTCCCCGC AGACCGTGCT CAAGAGCTCC AAGTTGGCCC 
7081 CCGTGCACCC TCTAGCAGAG CAGGTGAAAA TAATAACACA TAACGGGAGG GCCGGCGGTT 
7141 ACCAGGTCGA CGGATATGAC GGCAGGGTCC TACTACCATG TGGATCGGCC ATTCCGGTCC 
72 01 CTGAGTTTCA GGCTTTGAGC GAGAGCGCCA CTATGGTGTA CAACGAAAGG GAGTTCGTCA 

72 61 ACAGGAAACT ATACCATATT GCCGTTCACG GACCCTCGCT GAACACCGAC GAGGAGAACT 
7321 ACGAGAAAGT C AG AG C TG AA AGAACTGACG CCGAGTACGT GTTCGACGTA GATAAAAAAT 

73 81 GCTGCGTCAA GAGAGAGGAA GCGTCGGGTT TGGTGTTGGT GGGAGAGCTA ACCAACCCCC 

74 41 CGTTCCATGA ATTCGCCTAC GAAGGG C TG A AGATCAGGCC GTCGGCACCA TATAAGACTA 
7501 CAGTAGTAGG AGTC TTTGGG GTTCCGGGAT CAGGCAAGTC TG CTATT ATT AAGAGCCTCG 
7561 TGACCAAACA CGATCTGGTC ACCAGCGGCA AGAAGGAGAA CTGCCAGGAA ATAGTTAACG 
7621 ACGTGAAGAA GCACCGCGGG AAGGGGACAA GTAGGGAAAA CAGTGACTCC ATCCTGCTAA 
76 81 ACGGGTGTCG TCGTGCCGTG GACATCCTAT ATGTGGACGA GGCTTTCGCT TGCCATTCCG 
7741 GTACTCTGCT GGCCCTAATT GCTCTTGTTA AACCTCGGAG CAAAGTGGTG TTATGCGGAG 
78 01 ACCCCAAGCA ATGCGGATTC TTCAATATGA TGCAGCTTAA GGTGAACTTC AACCACAACA 
7861 TCTGCACTGA AGTATGTCAT AAAAGTATAT CCAGACGTTG CACGCGTCCA GTCACGGCCA 
7 921 TCGTGTCTAC GTTGCACTAC GGAGGCAAGA TGCGCACGAC CAACCCGTGC AACAAACCCA 

7 981 TAATCATAGA C AC C AC AGGA CAGACCAAGC CCAAGCCAGG AGACATCGTG TTAACATGCT 

8 041 TCCGAGGCTG GGCAAAGCAG CTGCAGTTGG ACTACCGTGG ACACGAAGTC ATG AC AG C AG 
8101 CAGCATCTCA GGGCCTCACC CGCAAAGGGG TATACGCCGT AAGGCAGAAG GTGAATGAAA 
8161 ATCCCTTGTA TGCCCCTGCG TCGGAGCACG TGAATGTACT GCTGACGCGC ACTGAGGATA 
82 21 GGCTGGTGTG GAAAACGCTG GCCGGCGATC CCTGGATTAA GGTCCTATCA AACATTCCAC 
82 81 AGGGTAACTT TACGGCCACA TTGGAAGAAT GGCAAGAAGA ACACGACAAA ATAATGAAGG 
8 341 TGATTGAAGG ACCGGCTGCG CCTGTGGACG CGTTCCAGAA CAAAGCGAAC GTGTGTTGGG 
84 01 CGAAAAGCCT GGTGCCTGTC CTGGACACTG CCGGAATCAG ATTG AC AG C A GAGGAGTGGA 
84 61 G C AC C ATAAT TACAGCATTT AAGGAGGACA GAGCTTACTC TCCAGTGGTG GCCTTGAATG 
8 521 AAATTTGCAC CAAGTACTAT GGAGTTG AC C TGGACAGTGG CCTGTTTTCT GCCCCGAAGG 
8 581 TGTCCCTGTA TTACGAGAAC AACCACTGGG AT AAC AG AC C TGGTGGAAGG ATGTATGGAT 

V641 TCAATGCCGC AACAGCTGCC AGGCTGGAAG CTAGACATAC CTTCCTGAAG GGGCAGTGGC 
87 01 ATACGGGCAA GCAGGCAGTT ATCGCAGAAA GAAAAATC C A ACCGCTTTCT GTGCTGGACA 
8761 ATGTAATTCC TATCAACCGC AGGCTGCCGC ACGCCCTGGT GGCTGAGTAC AAGACGGTTA 
8821< AAGGCAGTAG GGTTGAGTGG CTGGTCAATA AAGTAAGAGG GTACCACGTC CTGCTGGTGA 
8881 GTGAGTACAA CCTGGCTTTG CCTCGACGCA GGGTCACTTG GTTGTCACCG CTGAATGTCA 

8 941 CAGGCGCCGA TAGGTGCTAC GACCTAAGTT TAGGACTGCC GGCTGACGCC GGCAGGTTCG 

9 001 ACTTGGTCTT TGTGAACATT CACACGGAAT TCAGAATCCA CCACTACCAG CAGTGTGTCG 
9061 ACCACGCCAT GAAGCTGCAG ATGCTTGGGG GAGATGCGCT ACGACTGCTA AAACCCGGCG 
9121 GCATCTTGAT GAGAGCTTAC GGATACGCCG ATAAAATCAG CGAAGCCGTT GTTTCCTCCT 
9181 TAAG C AG AAA GTTCTCGTCT GCAAGAGTGT TGCGCCCGGA TTGTGTCACC AG C AAT AC AG 
9241 AAGTGTTCTT GCTGTTCTCC AACTTTGACA ACGGAAAGAG ACCCTCTACG CTAC AC C AG A 
93 01 TGAATACCAA GCTGAGTGCC GTGTATGCCG G AG AAG C CAT GCACACGGCC GGGTGTGCAC 
93 61 CATC CTACAG AGTTAAGAGA GCAGACATAG CCACGTGCAC AGAAGCGGCT GTGGTTAACG - 



94 21 CAGCTAACGC CCGTGGAACT GTAGGGGATG GCGTATGCAG GGCCGTGGCG AAGAAATGGC 
94 81 CGTCAGCCTT TAAGGGAGCA GCAACACCAG TGGGCACAAT TAAAACAGTC ATGTGCGGCT 
9541 CGTACCCCGT CATCCACGCT GTAGCGCCTA ATTTCTCTGC CACGACTGAA GCGGAAGGGG 

9 6 01 ACCGCGAATT GGCCGCTGTC TACCGGGCAG TGGCCGCCGA AGTAAACAGA CTGTCACTGA 
96 61 GCAGCGTAGC CATCCCGCTG CTGTCCACAG GAGTGTTCAG CGGCGGAAGA GATAGGCTGC 
9721 AGCAATCCCT CAACCATCTA TTCACAGCAA TGGACGCCAC GGACGCTGAC GTGACCATCT 
9781 AC TG C AG AG A CAAAAGTTGG GAGAAGAAAA TCCAGGAAGC CATTGACATG AGGACGGCTG 
9841 TGGAGTTGCT CAATGATGAC GTGGAGCTGA CCACAGACTT GGTGAGAGTG CACCCGGACA 
9 9 01 GCAGCCTGGT GGGTCGTAAG GGCTACAGTA CCACTGACGG GTCGCTGTAC TCGTACTTTG 
9 961 AAGGTACGAA ATTCAACCAG GCTGCTATTG ATATGGCAGA GATACTGACG TTGTGGCCCA 

10021 GACTGCAAGA GGCAAACGAA CAGATATGCC TATACGCGCT GGGCGAAACA ATGGACAACA 
10 081 TCAGATCCAA ATGTCCGGTG AACGATTCCG ATTCATCAAC ACCTCCCAGG ACAGTGCCCT 
10141 GCCTGTGCCG CTACGCAATG ACAGCAGAAC GGATCGCCCG C CTTAGGTC A C AC C AAGTT A 
10201 AAAGCATGGT GGTTTGCTCA TCTTTTCCCC TCCCGAAATA CCATGTAGAT GGGGTGCAGA 
10261 AGGTAAAGTG C G AG AAGGTT CTCCTGTTCG ACCCGACGGT ACCTTCAGTG GTTAGTC CGC 
10321 GGAAGTATGC CGCATCTACG ACGGAC C ACT CAGATCGGTC GTTACGAGGG TTTGACTTGG 
10381 ACTGGACCAC CGACTCGTCT TCCACTGCCA G CGATAC CAT GTCGCTACCC AGTTTG C AGT 
10441 CGTGTGACAT CGACTCGATC TACGAGCCAA TGGCTCCCAT AGTAGTGACG GCTGACGTAC 
105 01 ACCCTGAACC CGCAGGCATC GCGGACCTGG CGGCAGATGT GCACCCTGAA CCCGCAGACC 
10561 ATGTGGACCT GGAGAACCCG ATTCCTCCAC CGCGCCCGAA GAGAGCTGCA TACCTTGCCT 
10621 CCCGCGCGGC GGAGCGACCG GTGCCGGCGC CGAGAAAGCC GACGCCTGCC CCAAGGACTG 
10681 CGTTTAGGAA CAAGCTGCCT TTGACGTTCG GCGACTTTGA CGAGCACGAG GTCGATGCGT 
10741 TGGCCTCCGG GATTACTTTC GGAGACTTCG ACGACGTCCT GCGACTAGGC CGCGCGGGTG 
10801 CATATATTTT CTCCTCGGAC ACTGGCAGCG GACATTTACA ACAAAAATCC GTTAGGCAGC 
10861 ACAATCTCCA GTGCGCACAA CTGGATGCGG TCCAGGAGGA GAAAATGTAC CCGCCAAAAT 

10 921 TGGATACTGA GAGGGAGAAG CTGTTGCTGC TG AAAATG C A GATGCACCCA TCGGAGG CTA 
10981 ATAAGAGTCG ATACCAGTCT CGCAAAGTGG AGAACATGAA AGCCACGGTG GTGGACAGGC 
11041 TCACATCGGG GGCCAGATTG TACACGGGAG CGGACGTAGG CCGCATACCA ACATACGCGG 
11101 TTCGGTACCC CCGCCCCGTG TACTCCCCTA CCGTGATCGA AAGATTCTCA AGCCCCGATG 
11161 TAG C AAT CGC AGCGTGCAAC GAATAC CTAT C C AG AAATTA CCCAACAGTG GCGTCGTACC 
112 21 AGATAACAGA TG AATAC G AC GCATACTTGG ACATGGTTGA CGGGTCGGAT AGTTGCTTGG 
112 81 AC AG AG C G AC ATTCTGCCCG GCGAAGCTCC GGTGCTACCC G AAAC AT CAT GCGTACCACC 
11341 AGCCGACTGT ACGCAGTGCC GTCCCGTCAC CCTTTCAGAA CACACTACAG AACGTGCTAG 
114 01 CGGCTGCCAC CAAGAGAAAC TGCAACGTCA CGCAAATGCG AGAACTACCC ACCATGGACT 
114 61 CGGCAGTGTT CAACGTGGAG TGCTTCAAGC GCTATGCCTG CTCCGGAGAA TATTGGGAAG 
11521 AATATGCTAA ACAACCTATC CGGATAACCA CTGAGAACAT CACTACCTAT GTGACCAAAT 
11581 TGAAAGGCCC GAAAGCTGCT GCCTTGTTCG CTAAGAC C C A CAACTTGGTT CCGCTGCAGG 
11641 AGGTTCCCAT GGACAGATTC ACGGTCGACA TGAAACGAGA TGTCAAAGTC ACTCCAGGGA 
117 01 C G AAAC AC AC AGAGGAAAGA CCCAAAGTCC AGGTAATTCA AGCAGCGGAG CCATTGGCGA 

117 61 CCGCTTACCT GTGCGGCATC CACAGGGAAT TAGTAAGGAG ACTAAATGCT GTGTTACGCC 
11821 CTAACGTGCA CACATTGTTT GATATGTCGG CCGAAGACTT TGACGCGATC ATCGCCTCTC 

118 81 ACTTCCACCC AGGAGACCCG GTTCTAGAGA CGGACATTGC ATCATTCGAC AAAAGCCAGG 
11941 ACGACTCCTT GGCTCTTACA GGTTTAATGA TCCTCGAAGA TCTAGGGGTG GATC AGTAC C 
12 001 TGCTGGACTT GATCGAGGCA GCCTTTGGGG AAATATC C AG CTGTCACCTA CCAACTGGCA 
12 061 CGCGCTTCAA GTTCGGAGCT ATGATGAAAT CGGGCATGTT TCTGACTTTG TTTATTAACA 
12121 CTGTTTTGAA CATCACCATA GCAAGCAGGG TACTGGAGCA GAGACTCACT GACTCCGCCT 
12181 GTGCGGCCTT CATCGGCGAC G AC AAC AT C G TTCACGGAGT GATCTCCGAC AAGCTGATGG 
12241 CGGAGAGGTG CGCGTCGTGG GTCAACATGG AGGTGAAGAT CATTGACGCT GTCATGGGCG 
12 3 01^ AAAAACCCCC ATATTTTTGT GGGGGATTCA TAGTTTTTGA CAGCGTCACA CAGACCGCCT 
12361 GCCGTGTTTC AGACCCACTT AAGCGCCTGT TCAAGTTGGG TAAGCCGCTA ACAGCTGAAG 
124 21 ACAAGCAGGA CGAAGACAGG CGACGAGCAC TGAGTGACGA GGTT 
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pDESTIO 6708 bp 



Location (Base Nos . ) 
23 . . 152 
461 . .337 
711 . . 
1490 . 
1712 , 
2058 . 
3394 
4510 
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1370 
. 1574 
.2017 
. .2182 
. .4369 
. .5164 
. .62 



Gene Encoded 

Ppolh 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

ori 

genR 



1 CCCCGGATGA AGTGGTTCGC ATCCTCGGTT TTCTGGAAGG CGAGCATCGT TTGTTCGCCC 
61 AGGACTCTAG CTATAGTTCT AGTGGTTGGC TAC GTATACT CCGGAATATT AATAGATCAT 
121 GGAGATAATT AAAATGATAA CCATCTCGCA AATAAATAAG TATTTTACTG TTTTCGTAAC 
181 AGTTTTGTAA TAAAAAAACC TATAAATATT CCGGATTATT CATACCGTCC CACCATCGGG 
241 CGCGGATCTC GGTCCGAAAC CATGTCGTAC TACCATCACC AT C AC CATC A CGATTACGAT 
3 01 ATCCCAACGA CCGAAAACCT GTATTTTCAG GGCATCACAA GTTTGTACAA AAAAGCTGAA 
361 CGAGAAACGT AAAATGATAT AAATATCAAT ATATTAAATT AG ATTTTG C A TAAAAAACAG 
421 ACTACATAAT ACTGTAAAAC AC AAC AT AT C CAGTCACTAT GGCGGCCGCT AAGTTGGCAG 
481 CATCACCCGA CGCACTTTGC GCCGAATAAA TACCTGTGAC GGAAGATCAC TTCGCAGAAT 
541 AAATAAATCC TGGTGTCCCT GTTGATACCG GGAAGCCCTG GGCCAACTTT TGGCGAAAAT 
6 01 GAGACGTTGA TCGGCACGTA AG AGGTTC C A ACTTTCACCA TAATGAAATA AG AT C AC TAC 
661 CGGGCGTATT TTTTGAGTTA TCGAGATTTT CAGGAGCTAA GGAAGCTAAA ATGGAGAAAA 
721 AAATCACTGG ATATACCACC GTTGATATAT CCCAATGGCA TCGTAAAGAA CATTTTGAGG 
781 CATTTCAGTC AGTTGCTCAA TGTAC C TATA ACCAGACCGT TCAGCTGGAT ATTACGGCCT 
841 TTTTAAAGAC C GT AAAG AAA AATAAG C AC A AGTTTTATCC GGCCTTTATT CACATTCTTG 
901 CCCGCCTGAT GAATGCTCAT CCGGAATTCC GTATGGCAAT GAAAGACGGT GAGCTGGTGA 
961 TATGGGATAG TGTTCACCCT TGTTACACCG TTTTCCATGA GCAAACTGAA ACGTTTTCAT 
1021 CGCTCTGGAG TGAATACCAC GACGATTTCC GGCAGTTTCT ACACATATAT TCGCAAGATG 
1081 TGGCGTGTTA CGGTGAAAAC CTGGCCTATT TCCCTAAAGG GTTTATTGAG AATATGTTTT 
1141 TCGTCTCAGC CAATCCCTGG GTGAGTTTCA C C AGTTTTG A TTTAAACGTG GCCAATATGG 

12 01 ACAACTTCTT CGCCCCCGTT TTCACCATGG GCAAATATTA TACGCAAGGC GACAAGGTGC 
1261 TGATGCCGCT GGCGATTCAG GTTCATCATG CCGTCTGTGA TGGCTTCCAT GTCGGCAGAA 
1321 TGCTTAATGA ATTACAACAG TACTGCGATG AGTGGCAGGG CGGGGCGTAA ACGCGTGGAT 

13 81 CCGGCTTACT AAAAGCCAGA TAACAGTATG CGTATTTGCG CGCTGATTTT TGCGGTATAA 
1441 GAATATATAC TGATATGTAT ACCCGAAGTA TGTCAAAAAG AGGTGTGCTA TGAAGCAGCG 
15 01 TATTACAGTG ACAGTTGACA GCGACAGCTA TCAGTTGCTC AAGG CATATA TGATGTCAAT 
1561 ATCTCCGGTC TGGTAAGCAC AACCATGCAG AATGAAGCCC GTCGTCTGCG TGCCGAACGC 
1621 TGGAAAGCGG AAAATCAGGA AGGGATGGCT GAGGTCGCCC GGTTTATTGA AATGAACGGC 
1681 TCTTTTGCTG AC GAG AAC AG GGACTGGTGA AATGCAGTTT AAGGTTTACA CCTATAAAAG 
1741 AGAGAGCCGT TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA CGCCCGGGCG 
18 01 ACGGATGGTG ATCCCCCTGG CCAGTGCACG TCTGCTGTCA G AT AAAGT C T CCCGTGAACT 

,1861 TTACCCGGTG GTGCATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG ATATGGCCAG 
/ 1921 TGTGCCGGTC TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG AAAATGACAT 
1981 CAAAAACGCC ATTAACCTGA TGTTCTGGGG AATATAAATG TCAGGCTCCC TTATACACAG 
2 041 CCAGTCTGCA GGTCGAC CAT AGTGACTGGA TATGTTGTGT TTTACAGTAT TATGTAGTCT 
2101- GTTTTTTATG CAAAATCTAA TTTAATATAT TGATATTTAT ATCATTTTAC GTTTCTCGTT 
2161 CAGCTTTCTT GTACAAAGTG GTGATGCCAT GGATC CGGAA TTCAAAGGCC TACGTCGACG 
22 21 AGCTCAACTA GTGCGGCCGC TTTCGAATCT AGAGCCTGCA GTCTCGAGGC ATGCGGTACC 
2 281 AAGCTTGTCG AGAAGTACTA GAGGATCATA ATCAGCCATA CCACATTTGT AGAGGTTTTA 
2 341 CTTGCTTTAA AAAACCTCCC ACACCTCCCC CTGAACCTGA AACATAAAAT GAATGCAATT 
24 01 GTTGTTGTTA ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA 
24 61 AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC 
2 521 AATGTATCTT ATCATGTCTG GATCTGATCA CTGCTTGAGC CTAGGAGATC CGAACCAGAT 
2 581 AAGTGAAATC TAGTTC C AAA CTATTTTGTC ATTTTTAATT TTCGTATTAG CTTACGACGC - 



2 641 TACACCCAGT TCCCATCTAT TTTGTCACTC TTCCCTAAAT AATCCTTAAA AACTCCATTT 
27 01 CCACCCCTCC CAGTTCCCAA CTATTTTGTC CGCCCACAGC GGGGCATTTT TCTTCCTGTT 
2761 ATGTTTTTAA TCAAACATCC TGCCAACTCC ATGTGACAAA CCGTCATCTT CGGCTACTTT 
2 821 TTCTCTGTCA CAGAATGAAA ATTTTTCTGT CATCTCTTCG TTATTAATGT TTGTAATTGA 
2881 CTGAATATCA ACGCTTATTT GCAGCCTGAA TGGCGAATGG GACGCGCCCT GTAGCGGCGC 

2 941 ATTAAGCGCG GCGGGTGTGG TGGTTACGCG CAGCGTGACC GCTACACTTG CCAGCGCCCT 

3 001 AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG 
3 061 TCAAGCTCTA AATCGGGGGC TCCCTTTAGG GTTCCGATTT AGTGCTTTAC GGCACCTCGA 
3121 CCCCAAAAAA CTTGATTAGG GTGATGGTTC ACGTAGTGGG CCATCGCCCT GATAGACGGT 
3181 TTTTCGCCCT TTGACGTTGG AGTCCACGTT CTTTAATAGT GGACTCTTGT TCCAAACTGG 
3241 AACAACACTC AACCCTATCT CGGTCTATTC TTTTGATTTA TAAGGGATTT TGCCGATTTC 
3 3 01 GGCCTATTGG TTAAAAAATG AGCTGATTTA ACAAAAATTT AACGCGAATT TTAACAAAAT 
3361 ATTAACGTTT AC AATTT C AG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG 
3421 TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT 
34 81 GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT 
3 541 TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT 
3 6 01 AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG 
3 661 CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA 
3721 AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG 
3781 CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT 
3 841 TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC 
3 9 01 TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA 

3 961 CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATG AAG C CAT 

4 021 ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG GCAACAACGT TGCGCAAACT 
4 081 ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC 
4141 GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA 
4 2 01 TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTAT C ATT GCAGCACTGG GGCCAGATGG 
42 61 TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG 
4 321 AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTC AG AC C A 
4 381 AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA 
4441 GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA 
4 501 CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG 
4 561 CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA 
4 621 TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGAT AC C AAA 
4681 TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC 
4741 TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG 
4 8 01 TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC 
4 861 GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT 
4 921 ACAGCGTGAG CATTGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC 
4 981 GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG 
5041 GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG 
5101 CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT 
5161 GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA 
5221 TAACCGTATT ACCGC CTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG 
52 81 CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA 
5341 TCTGTGCGGT ATTTCACACC GCAGAC C AG C CGCGTAACCT GGCAAAATCG GTTACGGTTG 
54 01 AGTAATAAAT GGATGCCCTG CGTAAGCGGG TGTGGGCGGA CAATAAAGTC TTAAACTGAA 
54 61 CAAAATAGAT CTAAACTATG ACAATAAAGT CTTAAACTAG ACAGAATAGT TGTAAACTGA 
5521 -AATCAGTCCA GTTATGCTGT G AAAAAG CAT ACTGGACTTT TGTTATGGCT AAAGCAAACT 
5581 CTTCATTTTC TGAAGTGCAA ATTGCCCGTC GTATTAAAGA GGGGCGTGGC CAAGGGCATG 
5641 GTAAAGACTA TATTCGCGGC GTTGTGACAA TTTACCGAAC AACTCCGCGG CCGGGAAGCC 

57 01 GATCTCGGCT TGAACGAATT GTTAGGTGGC GGTACTTGGG TCGATATCAA AGTGCATCAC 
5761 TTCTTCCCGT ATGCCCAACT TTGTATAGAG AGCCACTGCG GGATCGTCAC CGTAATCTGC 

58 21 TTGCACGTAG ATCACATAAG CACCAAGCGC GTTGGCCTCA TGCTTGAGGA GATTGATGAG 
58 81 CGCGGTGGCA ATGCCCTGCC TCCGGTGCTC GCCGGAGACT GCGAGATCAT AGATATAGAT 
5941 CTCACTACGC GGCTGCTCAA ACCTGGGCAG AACGTAAGCC GCGAGAGCGC CAACAACCGC 
6 001 TTCTTGGTCG AAGGCAGCAA GCGCGATGAA TGTCTTACTA CGGAGCAAGT TCCCGAGGTA 
6 061 ATCGGAGTCC GGCTGATGTT GGGAGTAGGT GGCTACGTCT CCGAACTCAC GACCGAAAAG- 



6121 ATCAAGAGCA GCCCGCATGG ATTTGACTTG GTCAGGGCCG AGCCTACATG TGCGAATGAT 
6181 GCCCATACTT GAG C C AC C T A ACTTTGTTTT AGGGCGACTG CCCTGCTGCG TAACATCGTT 
624 1 GCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCAAACA TCGACCCACG GCGTAACGCG 

63 01 CTTGCTGCTT GGATGCCCGA GGCATAGACT GTACAAAAAA ACAGTCATAA C AAG C CAT G A 
6361 AAACCGCCAC TGCGCCGTTA CCACCGCTGC GTTCGGTCAA GGTTCTGGAC CAGTTGCGTG 
6421 AG C G CAT AC G CTACTTG CAT TACAGTTTAC GAACCGAACA GGCTTATGTC AACTGGGTTC 

64 81 GTGCCTTCAT CCGTTTCCAC GGTGTGCGTC ACCCGGCAAC CTTGGGCAGC AGCGAAGTCG 
6541 AGGCATTTCT GTCCTGGCTG GCGAACGAGC GCAAGGTTTC GGTCTCCACG CATCGTCAGG 
66 01 CATTGGCGGC CTTGCTGTTC TTCTACGGCA AGGTGCTGTG CACGGATCTG CCCTGGCTTC 
6661 AGGAGATCGG AAGACCTCGG CCGTCGCGGC GCTTGCCGGT GGTGCTGA 
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1 CGAGTTTACC ACTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA CCACTCCCTA 

6i tcagSSag agaaaagtga aagtcgagtt taccactccc tatcagtgat agagaaaagt 

ill GAAAGTCGAG TTTACCACTC CCTATCAGTG ATAGAGAAAA GTGAAAGTCG AGTTTACCAC 
III TCCCTATCAG TGATAGAGAA AAGTGAAAGT CGAGTTTACC ACTCCCTATC AGTGATAGAG 
III AAAaSgaAA GTCGAGTTTA CCACTCCCTA TCAGTGATAG AGAAAAGTGA AAGTCGAGCT 
30i c^accSg" GTCGAGTAGG CGTGTACGGT GGGAGGCCTA TATAAGCAGA GCTCGTTTAG 
III tgaIccgtca GATCGCCTGG AGACGCCATC CACGCTGTTT TGACCTCCAT AGAAGACACC 
42i ggcSccgatc CAGCCTCCGC GGCCCCGAAT TCGAGCTCGG TACCCGGGGA tcctctagag 
481 TCGAGGTCGA CGGTATCGAT AAGCTTGATA TCAACAAGTT TGTACAAAAA AGCTGAACGA 
til GAAACGTAAA ATGATATAAA TATCAATATA TTAAATTAGA TTTTGCATAA AAAACAGACT 

6oi acISaaSS? SLaacaca ACATATCCAG TCACTATGGC ggccgctaag ttggcagcat 
661 SScSacgc actttgcgcc gaataaatac ctgtgacgga agatcacttc gcagaataaa 
72i SaatSSg tgtccctgtt gataccggga agccctgggc caacttttgg cgaaaatgag 

781 ACGTTGATCG GCACGTAAGA GGTTCCAACT TTCACCATAA TGAAATAAGA TCACTACCGG 
III GCGTATTTTT TGAGTTATCG AGATTTTCAG GAGCTAAGGA AG CTAAAATG GAGAAAAAAA 
90i tcaSggSa TACCACCGTT GATATATCCC AATGGCATCG taaagaacat TTTGAGGCAT 
961 SclScAGT TGCTCAATGT ACCTATAAC C AGACCGTTCA GCTGGATATT ACGGCCTTTT 
io" SaaSccgt AAAGAAAAAT AAGCACAAGT TTTATCCGGC CTTTATTCAC ATTCTTGCCC 
lltl GCCTGATGAA TGCTCATCCG GAATTCCGTA TGGCAATGAA AGACGGTGAG CTGGTGATAT 
llll gggI?ag?ot TCACCCTTGT TACACCGTTT TCCATGAGCA AACTGAAACG ttttcatcgc 
lloi ?c?SgagtcI ataccacgac gatttccggc agtttctaca catatattcg caagatgtgg 
llel cgtcSaSS ?gaaaacctg gcctatttcc ctaaagggtt tattgagaat atgtttttcg 
lltl ^ctcIgJcaa tccctgggtg agtttcacca gttttgattt aaacgtggcc aatatggaca 

1381 IcSScC^ CCCCGTTTTC ACCATGGGCA AATATTATAC GCAAGGCGAC AAGGTGCTGA 
llll TGCCGCTGGC GATTCAGGTT CATCATGCCG TCTGTGATGG CTTC C ATGTC GGCAGAATGC 
1501 Saatgaatt ACAACAGTAC TGCGATGAGT GGCAGGGCGG GGCGTAAAGA tctggatccg 
llll GCTTACTAAA AGCCAGATAA CAGTATGCGT ATTTGCGCGC TGATTTTTGC GGTATAAGAA 
lltl ?SI?ACT^A TATGTATACC CGAAGTATGT CAAAAAGAGG TGTGCTATGA AGCAGCGTAT 
i68i taSgtcaca GTTGACAGCG ACAGCTATCA GTTGCTCAAG GCATATATGA TGTCAATATC 

i74i SSgtctgg taagcacaac catgcagaat gaagcccgtc gtctgcgtgc cgaacgctgg 

1801 AAAGCGGAAA ATCAGGAAGG GATGGCTGAG GTCGCCCGGT TTATTGAAAT GAACGGCTCT 
i'861 TTTGCTGACG AGAACAGGGA CTGGTGAAAT G CAGTTT AAG GTTT AC AC CT ATAAAAGAGA 
nil GAGCCGTTAT CGTCTGTTTG TGGATGTACA GAGTGATATT ATTGACACGC CCGGGCGACG 
llll GATGGTGATC CCCCTGGCCA GTGCACGTCT GCTGTCAGAT AAAGTCTCCC GTGAACTTTA 

20"* c?Sg?gotg CATATCGGGG ATGAAAGCTG gcgcatgatg accaccgata tggccagtgt 

2X01 GCCGGTCTCC GTTATCGGGG AAGAAGTGGC TGATCTCAGC CACCGCGAAA ATGACATCAA 
llll AAACGCCATT AACCTGATGT TCTGGGGAAT ATAAATGTCA GGCTCCCTTA TACACAGCCA 
2221 GTCTGCAGGT CGACCATAGT GACTGGATAT GTTGTGTTTT ACAGTATTAT GTAGTCTGTT 
2281 SSatgcaa AATCTAATTT AATATATTGA TATTTATATC ATTTTACGTT TCTCGTTCAG 
2341 CTTTCTTGTA CAAAGTGGTT GATATCGAAT TCCTGCAGCC CGGGGGATCC ACTAGTTCTA 
llll GAGCACTGCG ATGAGTGGCA GGGCGGGGCG TAATTTTTTT AAGG C AGTTA TTGGTGCCCT 
llll TAAACGCCTG GTGCTACGCC TGAATAAGTG ATAATAAGCG GATGAATGGC AGAAATTCGC 
2521 JcSatcSg TGAAGGAACC TTACTTCTGT GGTGTGACAT AATTGGACAA ACTACCTACA- 



5941 CAACTCGGTC GCCGCATACA CTATTCTCAG AATGACTTGG TTGAGTACTC ACCAGTCACA 
6001 GAAAAGCATC TTACGGATGG CATGACAGTA AGAGAATTAT GCAGTGCTGC CATAACCATG 
6 061 AGTGATAACA CTGCGGCCAA CTTACTTCTG ACAACGATCG GAGGACCGAA GGAGCTAACC 
6121 GCTTTTTTGC ACAACATGGG GGATCATGTA ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG 
6181 AATGAAGCCA TACCAAACGA CGAGCGTGAC ACCACGATGC CTGTAGCAAT GGCAACAACG 
6241 TTGCGCAAAC TATTAACTGG CGAACTACTT ACTCTAGCTT CCCGGCAACA ATTAATAGAC 
63 01 TGGATGGAGG CGGATAAAGT TGCAGGACCA CTTCTGCGCT CGGCCCTTCC GGCTGGCTGG 
6361 TTTATTGCTG ATAAATCTGG AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG 
6421 GGGCCAGATG GTAAGCCCTC CCGTATCGTA GTTATCTACA CGACGGGGAG TCAGGCAACT 
6481 ATGGATGAAC GAAATAGACA GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA 
6541 CTGTCAGACC AAGTTTACTC ATATATACTT TAGATTGATT TAAAACTTCA TTTTTAATTT 
66 01 AAAAGGATCT AGGTGAAGAT CCTTTTTGAT AATCTCATGA CCAAAATCCC TTAACGTGAG 
66 61 TTTTCGTTCC ACTGAGCGTC AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT 
6721 TTTTTTCTGC GCGTAATCTG CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT 
6781 TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG 
6841 C AG AT AC C AA ATACTGTCCT TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT 

6 901 GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC 
6961 GATAAGTCGT GTCTTACCGG GTTGGACTCA AGACGATAGT TACCGGATAA GGCGCAGCGG 

7 021 TCGGGCTGAA CGGGGGGTTC GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA 
7081 CTGAGATACC TACAGCGTGA GCATTGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG 
7141 GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG 
72 01 GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA 
7261 TTTTTGTGAT GCTCGTCA 



Figure' "5iA ( - 



Native protein in E. coli: XPl 
promoter 



3721 
3781 
3841 
3901 
3961 



tgggcaaacc aagacagcta aa gatct ctc acctaccaaa caatgccccc ctgcaaaaaa 
acccgtttgg ttctgtcgat ttctagjagag tggatggttt gttacggggg gacgtttttt 



taaattcata taaaaaacat 
atttaagtat attttttgta 

ttgacataaa taccactggc 
actgtattt atggtgaccg 



I 



gaaggtgacg ctcttaaaaa 
cttccactgc gagaattttt 

tggggtgtgt gatacgaaac 
accccacaca ctatgctttg 



acagataacc atctgcggtg ataaattatc 
tgtctattgg tagacgccac tatttaatag 

ggtgatactb agcac&tcag caggacgcac 
ccactatgaj c tcgtgtagtc gtcctgcgtg 

ttaagecctg jaagaagggca gcattcaaag 
aattcgggac tjtcttcccgt cgtaagtttc 

* 

gaagcattgg gatcatcaca agtttjgtaca 
cttcgtaacc ctagtag tgt tcaaacatgt 



tctggcggtg 
agaccgccac 

tgaccaccat 
actggtggta 

cagaaggctt 
gtcttccgaa 



aaaaagcj tgH^ 
^tttcg;act 



Li. 



01 



• 5 3 




...Ahdl 



3000 

/ 



pDEST13 5848 bp 



Location (Base Nos. ) 
599 . . 1458 
4123 . .3998 
4372 . .5031 
5151 . . 5235 
5373. .5678 
5719 . .5843 



Gene Encoded 

ampR 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 



1 TTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA CCCAACTTAA 
61 TCGCCTTGCA GCACATCCCC CTTTCGCCAG CTGGCGTAAT AG C G AAGAGG CCCGCACCGA 
121 TCGCCCTTCC CAACAGTTGC GCAGCCTGAA TGGCGAATGG CGCCTGATGC GGTATTTTCT 
181 CCTTACGCAT CTGTGCGGTA TTTCACACCG CATATGGTGC ACTCTCAGTA CAATCTGCTC 
241 TGATGCCGCA TAGTTAAGCC AGCCCCGACA CCCGCCAACA CCCGCTGACG CGCCCTGACG 
301 GGCTTGTCTG CTCCCGGCAT CCGCTTACAG ACAAGCTGTG ACCGTCTCCG GGAGCTGCAT 
361 GTGTCAGAGG TTTTCACCGT C ATCAC CGAA ACGCGCGAGA CGAAAGGGCC TCGTGATACG 
421 CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAG AC GT C AG GTGGCACTTT 
481 TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA 
541 TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT 
p 6 01 GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT 

ill 661 TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG 

y s f 721 AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA 

781 AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG 
841 TATTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT 
j» 9 01 TGAGTACTCA C C AGTC AC AG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG 

gi 961 CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG 

1021 AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GAT C ATGTAA CTCGCCTTGA 
1081 TCGTTGGGAA CCGGAGCTGA ATG AAG C CAT ACCAAACGAC GAGCGTGACA CCACGATGCC 
I- 1141 TGTAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGG C GAAC TACTTA CTCTAGCTTC 

~ : : 12 01 CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC 

~f 1261 GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG 

M 1321 CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC 

HJ 13 81 GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC 

Q 1441 ACTGATTAAG CATTGGTAAC TGTC AG AC CA AGTTTACTCA TAT AT AC TTT AGATTGATTT 

O 15 01 AAAACTTCAT TTTTAATTTA AAAGGAT CTA GGTGAAGATC CTTTTTGATA ATCTCATGAC 

15 61 CAAAATCCCT TAACGTGAGT TTTCGTTC CA CTGAGCGTCA GACCCCGTAG AAAAGATCAA 
1621 AGGATCTTCT TGAGATC CTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC 

16 81 ACCGCTACCA GCGGTGGTTT GTTTGCCGGA T C AAG AG C T A CCAACTCTTT TTCCGAAGGT 
1741 AACTGGCTTC AGCAGAGCGC AGATAC C AAA TACTGTTCTT CTAGTGTAGC CGTAGTTAGG 
18 01 CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC 
18 61 AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT 
1921 ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TG C AC AC AG C CCAGCTTGGA 

,1981 G C GAAC G AC C TACACCGAAC TGAGATACCT ACAGCGTGAG CATTGAGAAA GCGCCACGCT 
2 041 TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG 
2101 CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA 
2161 CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA 
2221, -CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT 
22 81 CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA 
2 341 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAG C GAGG AAG C G G AAG A 
24 01 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 
24 61 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTAGCT 
2 521 CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT TGTGTGGAAT 
2 581 TGTGAGCGGA TAACAATTTC ACACAGGAAA C AG CTATGAC CATGATTACG CCAAGCTTGG 
2641 CTGCAGGTGA TGATTATCAG C C AG C AG AG A TTAAGGAAAA CAGACAGGTT TATTGAGCGC 
2 7 01 TTATCTTTCC CTTTATTTTT GCTGCGGTAA GTCGCATAAA AAC C ATT CTT C ATAATTCAA - 



^J2P 35B 





2761 


TCCATTTACT 


ATGTTATGTT 

X A X> X> Xi X X X» X» 




2821 


TGCTCAATTG 


TTATCAGCTA 




2881 


TTCAGGC CAC 


TGACTAGCGA 




2941 


CATTGGGTAC 


TGTGGGTTTA 




3001 


CTTGAAGGTA 


AACTCATCAC 




3061 


CTGCTCAGGG 


TCAACGAGAA 




3121 


TGCGGTCATG 


GAATTACCTT 




3181 


TGTGCTTACC 


CATCTCTCCG 




3241 


CCCTGCCTGA 


ACATGAGAAA 




3301 


ACTAACCGCT 


TCATACATCT 




3361 


GCTAACTTTG 

V_,J X_» JL X U X \>" -L. X. X. V_J 


AGAATTTTTG 

X XV_J iiiTi X. J. J*, x, X, V_^J 




3421 


ATTAAATAAA 


GCAC CAACGC 




3481 


GGATAAGCCA 


AGTTCATTTT 

X J* V_J X- Vv/ X A X. ^ X. _1_ 




3541 


AAGCTGCTCT 


TGTGTTAATG 




3601 


GGGATAAATA 


TCTAACACCG 




3661 


CATGTACTAA 


GGAGGTTGTA 




3721 


TGGGCAAACC 


AAGACAGCTA 




3781 


TAAATTCATA 


TAAAAAACAT 




3841 


TTGACATAAA 


TACCACTGGC 




3901 


GAAGGTGACG 


CTCTTAAAAA 




3961 


TGGGGTGTGT 


GATACGAAAC 




4021 


ACGAGAAACG 


TAAAATGATA 


si J 


4081 


GACTACATAA 


TACTGTAAAA 


Lll 


4141 


GCATCACCCG 


ACGCACTTTG 




4201 


TAAATAAATC 


CTGGTGTCCC 


tj. jj 


4261 


TGAGACGTTG 


ATCGGCACGT 


S3 


4321 


CCGGGCGTAT 


TTTTTGAGTT 


y 
01 


4381 


AAAATCACTG 


GATATACCAC 


4441 


GCATTTCAGT 


CAGTTGCTCA 

X A^J X> X» N_J N_>#- A. \_r X A 




4501 


TTTTTAAAGA 


CCGTAAAGAA 


s 


4561 


GCCCGCCTGA 


TGAATGCTCA 


i - s 


4621 


ATATGGGATA 

X * ^ X X X* ^— * ^— / X X Xr X 1 


GTGTTCACCC 




4681 


TCGCTCTGGA 


GTGAATACCA 

X. W* -J- nt*V V_*-*A 




4741 


GTGGCGTGTT 


ACGGTGAAAA 


Hi 


48 01 


TTCGTCTCAG 


CCAATCCCTG 




4861 


GACAACTTCT 


TCGCCCCCGT 




4921 


CTGATGCCGC 


TGGCGATTCA 




4981 


ATGCTTAATG 


AATTACAACA 




5041 


TCCGGCTTAC 


TAAAAGCCAG 




5101 


AGAATATATA 


CTGATATGTA 




5161 


GTATTACAGT 


GACAGTTGAC 




5221 


TATCTCCGGT 


CTGGTAAGCA 




5281 


CTGGAAAGCG 


GAAAATCAGG 




5341 


CTCTTTTGCT 


GACGAGAACA 




5401 


GAGAGAGC CG 


TTATCGTCTG 




54 61 


GACGGATGGT 


vJJxT. J. v_- v_. \_ v, _L VJJ 




5521 


TTTACCCGGT 


GGTGCATATC 




5581 


GTGTGCCGGT 


CTCCGTTATC 




5641 


vTCAAAAACGC 


CATTAACCTG 




5701 


GCCAGTCTGC 


AGGTCGACCA 




5761 


TGTTTTTTAT 


GCAAAATCTA 




5821 


TCAGCTTTCT 


TGTACAAAGT 



CTGAGGGGAG TGAAAATTCC CCTAATTCGA TGAAGATTCT 
TGCGCCGACC AGAACACCTT GCCGATCAGC CAAACGTCTC 
TAACTTTCCC CACAACGGAA CAACTCTCAT TGCATGGGAT 
GTGGTTGTAA AAACACCTGA CCGCTATCCC TGATCAGTTT 
CCCCAAGTCT GGCTATGCAG AAATCACCTG GCTCAACAGC 
TTAACATTCC GTCAGGAAAG CTTGGCTTGG AGCCTGTTGG 
CAACCTCAAG CCAGAATGCA GAATCACTGG CTTTTTTGGT 
CATCACCTTT GGTAAAGGTT CTAAGCTTAG GTGAGAACAT 
AAACAGGGTA CTCATACTCA CTTCTAAGTG ACGGCTGCAT 
CGTAGATTTC TCTGGCGATT GAAGGGCTAA ATTCTTCAAC 
CAAGCAATGC GGCGTTATAA GCATTTAATG CATTGATGCC 
CTGACTGCCC CATCCCCATC TTGTCTGCGA CAGATTCCTG 
TCTTTTTTTC ATAAATTGCT TTAAGGCGAC GTGCGTCCTC 
GTTTCTTTTT TGTGCTCATA CGTTAAATCT ATCACCGCAA 
TGCGTGTTGA CTATTTTACC TCTGGCGGTG ATAATGGTTG 
TGGAACAACG CATAACCCTG AAAGATTATG CAATGCGCTT 
AAGATCTCTC AC CTAC CAAA CAATGCCCCC CTGCAAAAAA 
ACAGATAACC ATCTGCGGTG ATAAATTATC TCTGGCGGTG 
GGTGATACTG AGCACATCAG CAGGACGCAC TG AC CAC CAT 
TTAAGCCCTG AAGAAGGGCA GCATTCAAAG CAGAAGGCTT 
GAAGCATTGG GATCATCACA AGTTTGTACA AAAAAGCTGA 
TAAATATCAA TATATTAAAT TAG ATTTTG C ATAAAAAACA 
CACAACATAT CCAGTCACTA TGGCGGCCGC TAAGTTGGCA 
CGCCGAATAA ATACCTGTGA CGGAAGATCA CTTCG CAGAA 
TGTTGATACC GGGAAGCCCT GGGCCAACTT TTGGCGAAAA 
AAGAGGTTCC AACTTTCACC ATAATGAAAT AAG AT C ACTA 
ATCGAGATTT TCAGGAGCTA AGGAAG CTAA AATGGAGAAA 
CGTTGATATA TCCCAATGGC ATCGTAAAGA ACATTTTGAG 
ATGT AC C TAT AAC C AG AC CG TTCAG CTGGA TATTACGGCC 
AAATAAGCAC AAGTTTTATC CGGCCTTTAT TCACATTCTT 
TCCGGAATTC CGTATGGCAA TGAAAGACGG TGAGCTGGTG 
TTGTTACACC GTTTTCCATG AGCAAACTGA AACGTTTTCA 
CGACGATTTC CGGCAGTTTC TACACATATA TT C G C AAG AT 
CCTGGCCTAT TTCCCTAAAG GGTTTATTGA GAATATGTTT 
GGTGAGTTTC AC C AGTTTTG ATTTAAACGT GGCCAATATG 
TTTCAC CATG GGCAAATATT ATACGCAAGG CGACAAGGTG 
GGTTCATCAT GCCGTCTGTG ATGGCTTCCA TGTCGGCAGA 
GTACTGCGAT GAGTGGCAGG GCGGGGCGTA AACGCGTGGA 
ATAACAGTAT GCGTATTTGC GCGCTGATTT TTGCGGTATA 
TACCCGAAGT ATGTCAAAAA GAGGTGTGCT ATGAAG C AG C 
AGCGACAGCT ATCAGTTGCT CAAGGCATAT ATGATGTCAA 
CAACCATGCA GAATGAAGCC CGTCGTCTGC GTGCCGAACG 
AAGGGATGGC TGAGGTCGCC CGGTTTATTG AAATGAACGG 
GGGACTGGTG AAATG C AGTT TAAGGTTTAC AC C TAT AAAA 
TTTGTGGATG TACAGAGTGA TATTATTGAC ACGCCCGGGC 
GCCAGTGCAC GTCTGCTGTC AGATAAAGTC TCCCGTGAAC 
GGGGATGAAA GCTGGCGCAT GATGACCACC GATATGGCCA 
GGGGAAGAAG TGGCTGATCT CAGCCACCGC GAAAATGACA 
ATGTTC TGGG GAATATAAAT GTCAGGCTCC GTTATACACA 
TAGTGACTGG ATATGTTGTG TTTTACAGTA TTATGTAGTC 
ATTTAATATA TTGATATTTA TATCATTTTA CGTTTCTCGT 
GGTGATAA 
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Figure 3 '/A- pDEST14 Native Protein Expression in E. coli, T7 

Promoter 



3961 



Asa 



tgccggccac gatgcgtccg gcgtagagga tcga featc tc gatcccgcga aaJtaatacg 
acggccggtg ctacgcaggc cgcatctcct agctctaciag ctagggcgct ttcteBtatgc 



4021 'actcactata 1 ggg agaccac aacggtttcc 



f tgagtgatat cccj tctggtg ttgccaaagg gagat 




a 



caagttt pta caaaa aaoct 
gttcaaacat gtttfcttcga . 



SgrA I 6381 



Sph I 6230 
EcoN 16165 
PshA I 6072 
Nru I 5816 



Sg/ II 1 
X6a I 59 
/VoM 192 



Bsg\ 5138 



7f/?1 1 1 I 4568 

Sap I 4435 
BspLU^^ 14313 




A/co I 955 
Mlu\ 1101 
BamH I 1107 
BssH II 1148 



SrfM522 
^Z. Smal 1523 

SsOCI 1635 
Sa/I 1782 
Bpi/1102 I 1967 

A//76I2196 



Hind III 2396 
C/a I 2402 
AatU 2502 



Eam1105 I 3420 





pDEST14 6422 bp (rotated to position 4000) 



Location (Base Nos . ) 
185 . . 61 
435 . . 1094 
1214 . . 1298 
1436 . . 1741 
1782 . . 1906 
2632 . . 3489 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 



1 CGATCCCGCG AAATTAATAC GACTCACTAT 
61 ACAAGTTTGT ACAAAAAAGC TGAACGAGAA 
121 AATTAGATTT TGCATAAAAA ACAGACTACA 
181 CTATGGCGGC CGCTAAGTTG GC AG CAT C AC 
241 TGACGGAAGA TCACTTCGCA GAATAAATAA 

3 01 CCTGGGCCAA CTTTTGGCGA AAATG AG AC G 
361 AC C ATAATG A AATAAGATCA CTACCGGGCG 
421 CTAAGGAAGC TAAAATGGAG AAAAAAATCA 

4 81 GGCATCGTAA AGAACATTTT GAGGCATTTC 
541 CCGTTCAGCT GGATATTACG GCCTTTTTAA 
6 01 ATCCGGCCTT TATTCACATT CTTGCCCGCC 
661 CAATGAAAGA CGGTGAGCTG GTGATATGGG 
721 ATGAGCAAAC TGAAACGTTT TCATCGCTCT 
781 TTCTACACAT ATATTCGCAA GATGTGGCGT 
841 AAGGGTTTAT TGAGAATATG TTTTTCGTCT 
901 TTGATTTAAA CGTGGCCAAT ATGGACAACT 
961 ATTAT AC G C A AGGCGACAAG GTGCTGATGC 

1021 GTGATGGCTT CCATGTCGGC AGAATGCTTA 
1081 AGGGCGGGGC GTAAACGCGT GGATCCGGCT 
1141 TGCGCGCTGA TTTTTGCGGT ATAAGAATAT 
12 01 AAAGAGGTGT G CTATGAAG C AGCGTATTAC 

12 61 GCTCAAGGCA TATATGATGT CAATATCTCC 
1321 GCCCGTCGTC TGCGTGCCGA ACGCTGGAAA 

13 81 GCCCGGTTTA TTGAAATGAA CGGCTCTTTT 

14 41 GTTTAAGGTT TACACCTATA AAAGAGAGAG 

15 01 TGATATTATT GACACGCCCG GGCGACGGAT 
1561 GTCAGATAAA GTCTCCCGTG AACTTTACCC 
1621 CATGATGACC AC CGATATGG CCAGTGTGCC 

16 81 TCTCAGCCAC CGCGAAAATG AC AT C AAAAA 
1741 AATGTCAGGC TCCCTTATAC AC AG C C AGTC 
18 01 GTGTTTTACA GTATTATGTA GTCTGTTTTT 
18 61 TTATATCATT TTAC GTTTCT CGTTCAGCTT 
1921 TAACAAAGCC CGAAAGGAAG CTGAGTTGGC 
,1981 ACCCCTTGGG GCCTCTAAAC GGGTCTTGAG 
2041 CGGATATCCA CAGGACGGGT GTGGTCGCCA 
2101 GCGAAGCGAG CAGGACTGGG CGGCGGCCAA 
2161 CGCATAGAAA TTGCATCAAC GCATATAGCG 
2 2 21^ TGTCGGAATG G AC GAT AT C C CGCAAGAGGC 
22 81 CTACAGCATC CAGGGTGACG GTGCCGAGGA 
2 341 ACGGTGCCTG ACTGCGTTAG CAATTTAACT 
24 01 TGATAAGCTG TCAAACATGA GAATTCTTGA 
24 61 TTATAGGTTA ATGTCATGAT AATAATGGTT 
2 521 AATGTGCGCG GAACCCCTAT TTGTTTATTT 
2581 ATGAGACAAT AACCCTGATA AATGCTTCAA 
2 641 CAACATTTCC GTGTCGCCCT TATTCCCTTT 
2701 C AC C C AG AAA CGCTGGTGAA AG T AAAAG AT 



AGGGAGACCA CAACGGTTTC CCTCTAGATC 
AC GTAAAATG ATATAAATAT CAATATATTA 
TAATACTGTA AAACACAACA TATC CAGTCA 
CCGACGCACT TTGCGCCGAA TAAATAC CTG 
ATCCTGGTGT C C CTGTTGAT ACCGGGAAGC 
TTGATCGGCA CGTAAGAGGT TCCAACTTTC 
TATTTTTTGA GTTATCGAGA TTTTCAGGAG 
CTGGATATAC CACCGTTGAT ATATCCCAAT 
AGTCAGTTGC TCAATGTACC TATAACCAGA 
AGACCGTAAA GAAAAATAAG CACAAGTTTT 
TGATGAATGC TCATCCGGAA TTCCGTATGG 
ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC 
GGAGTGAATA CCACGACGAT TTCCGGCAGT 
GTTACGGTGA AAACCTGGCC TATTTCCCTA 
CAGCCAATCC CTGGGTGAGT TTCACCAGTT 
TCTTCGCCCC CGTTTTCACC ATGGGCAAAT 
CGCTGGCGAT TCAGGTTCAT CATGCCGTCT 
ATGAATTACA ACAGTACTGC GATGAGTGGC 
TACTAAAAGC CAGATAACAG TATGCGTATT 
ATACTGATAT GT AT AC C C G A AGTATGTCAA 
AGTGACAGTT GACAGCGACA GCTATCAGTT 
GGTCTGGTAA G C AC AAC CAT GCAGAATGAA 
GCGGAAAATC AGGAAGGGAT GGCTGAGGTC 
GCTGACGAGA ACAGGGACTG GTGAAATGCA 
CCGTTATCGT CTGTTTGTGG ATGTACAGAG 
GGTGATCCCC CTGGCCAGTG CACGTCTGCT 
GGTGGTGCAT ATCGGGGATG AAAGCTGGCG 
GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA 
CGCCATTAAC CTGATGTTCT GGGGAATATA 
TGCAGGTCGA CCATAGTGAC TGGATATGTT 
TATGCAAAAT CTAATTTAAT ATATTGATAT 
TCTTGTACAA AGTGGTGATG ATCCGGCTGC 
TGCTGCCACC GCTGAGCAAT AACTAGCATA 
GGGTTTTTTG CTGAAAGGAG GAACTATATC 
TG AT C G C GT A GTCGATAGTG GCTCCAAGTA 
AGCGGTCGGA CAGTGCTCCG AGAACGGGTG 
CTAGCAGCAC GCCATAGTGA CTGGCGATGC 
CCGGCAGTAC CGGCATAACC AAGCCTATGC 
TGACGATGAG CGCATTGTTA G ATTT CAT AC 
GTGATAAACT AC C G C ATT AA AGCTTATCGA 
AGACGAAAGG GCCTCGTGAT ACGCCTATTT 
TCTTAGACGT CAGGTGGCAC TTTTCGGGGA 
TTCTAAATAC ATTCAAATAT GTATCCGCTC 
TAATATTGAA AAAGGAAGAG TATGAGTATT 
TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT 
GCTGAAGATC AGTTGGGTGC ACGAGTGGGT - 





27 61 TACATCGAAC TGGATCTCAA CAGCGGTAAG ATC CTTGAGA GTTTTCGCCC CGAAGAACGT 
2821 TTTCCAATGA TGAGCACTTT TAAAGTT CTG CTATGTGGCG CGGTATTATC CCGTGTTGAC 
2881 GCCGGGCAAG AGCAACTCGG TCGCCGCATA CACTATTCTC AGAATGACTT GGTTGAGTAC 

2 941 TCACCAGTCA C AG AAAAG C A TCTTACGGAT GGCATGACAG TAAGAGAATT ATGCAGTGCT 

3 001 GCCATAACCA TGAGTGATAA CACTGCGGCC AACTTACTTC TGACAACGAT CGGAGGACCG 
3 061 AAGGAGCTAA CCGCTTTTTT GCACAACATG GGGGATCATG TAACTCGCCT TGATCGTTGG 
3121 GAACCGGAGC TGAATGAAGC CATACCAAAC GACGAGCGTG ACACCACGAT GCCTGCAGCA 
3181 ATGGCAACAA CGTTGCGCAA ACTATTAACT GGCGAACTAC TTACTCTAGC TTCCCGGCAA 
3241 CAATTAATAG ACTGGATGGA GGCGGATAAA GTTGCAGGAC CACTTCTGCG CTCGGCCCTT 
3 3 01 CCGGCTGGCT GGTTTATTGC TGATAAATCT GGAGCCGGTG AGCGTGGGTC TCGCGGTATC 
3361 ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC TCCCGTATCG TAGTTATCTA CACGACGGGG 
3421 AGTCAGGCAA CTATGGATGA ACGAAATAGA CAGATCGCTG AG ATAGGTG C CTCACTGATT 
3481 AAGCATTGGT AACTGTCAGA CCAAGTTTAC TCATATATAC TTTAGATTGA TTTAAAACTT 
3 541 CATTTTTAAT TTAAAAGGAT CTAGGTGAAG ATCCTTTTTG ATAATCTCAT G AC C AAAATC 
3 6 01 CCTTAACGTG AGTTTTCGTT CCACTGAGCG TCAGACCCCG TAGAAAAGAT CAAAGGATCT 
3 6 61 TCTTGAGATC CTTTTTTTCT GCGCGTAATC TGCTGCTTGC AAACAAAAAA ACCACCGCTA 
3 721 CCAGCGGTGG TTTGTTTGCC GGATCAAGAG CTACCAACTC TTTTTCCGAA GGTAACTGGC 
3 781 TTCAGCAGAG CGCAGATACC AAATACTGTC CTTCTAGTGT AGCCGTAGTT AGGCCACCAC 
3 841 TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATC CTGTT ACCAGTGGCT 
3 901 GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT CAAGACGATA GTTACCGGAT 

3 961 AAGGCGCAGC GGTCGGGCTG AACGGGGGGT TCGTGCACAC AGCCCAGCTT GGAG CGAACG 

4 021 ACCTACACCG AACTGAGATA CCTACAGCGT GAGCTATGAG AAAGCGCCAC GCTTCCCGAA 
4 081 GGGAGAAAGG CGGACAGGTA TCCGGTAAGC GGCAGGGTCG GAACAGGAGA GCGCACGAGG 
4141 GAGCTTCCAG GGGGAAACGC CTGGTATCTT TATAGT C CTG TCGGGTTTCG CCACCTCTGA 
4 2 01 CTTGAGCGTC GATTTTTGTG ATGCTCGTCA GGGGGGCGGA GCCTATGGAA AAACGCCAGC 
42 61 AACGCGGCCT TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTG CTCACAT GTTCTTTCCT 
4 321 GCGTTATCCC CTGATTCTGT GGATAACCGT ATTACCGCCT TTG AGTG AG C TGATACCGCT 
4 381 CGCCGCAGCC GAACGACCGA GCGCAGCGAG TCAGTGAGCG AGGAAGCGGA AGAGCGCCTG 
4 441 ATGCGGTATT TTCTCCTTAC GCATCTGTGC GGTATTTCAC AC CGCATATA TGGTGCACTC 
4501 TCAGTACAAT CTGCTCTGAT GCCGCATAGT TAAGCCAGTA TACACTCCGC TATCGCTACG 
4 561 TGACTGGGTC ATGGCTGCGC CCCGACACCC GCCAACACCC GCTGACGCGC CCTGACGGGC 
4 621 TTGTCTGCTC CCGGCATCCG CTTACAGACA AGCTGTGACC GTCTCCGGGA GCTGCATGTG 
4681 TCAGAGGTTT TC AC CGTCAT C AC CGAAACG CGCGAGGCAG CTGCGGTAAA GCTCATCAGC 
4 741 GTGGTCGTGA AGCGATTCAC AGATGTCTGC CTGTTCATCC GCGTCCAGCT CGTTGAGTTT 
4 8 01 CTCCAGAAGC GTTAATGTCT GGCTTCTGAT AAAGCGGGCC ATGTTAAGGG CGGTTTTTTC 
4861 CTGTTTGGTC ACTGATGCCT CCGTGTAAGG GGGATTTCTG TTCATGGGGG T AATG AT AC C 
4 921 GATGAAACGA GAGAGGATGC TCACGATACG GGTTACTGAT GATGAACATG CCCGGTTACT 

4 981 GGAACGTTGT GAGGGTAAAC AACTGGCGGT ATGGATGCGG CGGGACCAGA GAAAAATCAC 
5041 TCAGGGTCAA TGCCAGCGCT TC GTTAATAC AGATGTAGGT GTTCCACAGG G TAG C C AG C A 
5101 GCATCCTGCG ATGCAGATCC GGAACATAAT GGTGCAGGGC GCTGACTTCC GCGTTTCCAG 
5161 ACTTTACGAA ACACGGAAAC CGAAGACCAT TCATGTTGTT GCTCAGGTCG CAGACGTTTT 
5221 GCAGCAGCAG TCGCTTCACG TTCGCTCGCG TATCGGTGAT TCATTCTGCT AACCAGTAAG 
5281 GCAACCCCGC CAGCCTAGCC GGGTCCTCAA CGACAGGAGC ACGATCATGC GCACCCGTGG 
5341 CCAGGACCCA ACGCTGCCCG AGATGCGCCG CGTGCGGCTG CTGGAGATGG CGGACGCGAT 
54 01 GGATATGTTC TGCCAAGGGT TGGTTTGCGC ATTCACAGTT CTCCGCAAGA ATTGATTGGC 
5'4 61 TCCAATTCTT GGAGTGGTGA ATCCGTTAGC GAGGTGCCGC CGGCTTCCAT TCAGGTCGAG 
5521 GTGGCCCGGC TCCATGCACC GCGACGCAAC GCGGGGAGGC AGACAAGGTA TAGGGCGGCG 
5581 CCTACAATCC ATGCCAACCC GTTCCATGTG CTCGCCGAGG CGGCATAAAT CGCCGTGACG 

5 641- ATCAGCGGTC CAGTGATCGA AGTTAGGCTG GTAAGAGCCG CGAGCGATCC TTGAAGCTGT 
5701 CCCTGATGGT CGTCATCTAC CTGCCTGGAC AGCATGGCCT GCAACGCGGG CATCCCGATG 
5761 CCGCCGGAAG CGAGAAGAAT CATAATGGGG AAGGCCATCC AGCCTCGCGT CGCGAACGCC 
5821 AGCAAGACGT AGCCCAGCGC GTCGGCCGCC ATGCCGGCGA TAATGGCCTG CTTCTCGCCG 
58 81 AAACGTTTGG TGGCGGGACC AGTGACGAAG GCTTGAGCGA GGGCGTGCAA GATTCCGAAT 

5 941 ACCGCAAGCG ACAGGCCGAT CATCGTCGCG CTCCAGCGAA AGCGGTCCTC GCCGAAAATG 

6 001 AC C C AG AG C G CTGCCGGCAC CTGTCCTACG AGTTGCATGA TAAAGAAGAC AGTCATAAGT 
6 061 GCGGCGACGA TAGTCATGCC CCGCGCCCAC CGGAAGGAGC TGACTGGGTT GAAGGCTCTC 
6121 AAGGGCATCG GT C GAT C G AC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AG C AG C C C AG 
6181 TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC - 





62 41 GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT 

6 3 01 GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC 

6361 AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GG AT CGAGAT 
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Aafll3097 



pDESTIS 7013 bp 



Location (Base Nos . ) 
108 . .776 
916 . .792 



Gene Encoded 



1025 . . 1537 
1804 . . 1888 
2026 . .2331 
2372 . .2496 
3233 . .4093 



GST 

attRl 

CmR 



inactivated ccdA 



ccdB 

attR2 

ampR 



1 ATCGAGATCT CGATCCCGCG AAATTAATAC GACTCACTAT AGGGAGAC C A CAACGGTTTC 
61 CCTCTAGAAA TAATTTTGTT TAAC TTTAAG AAGGAGATAT AC ATATGTC C CCTATACTAG 
121 GTTATTGGAA AATTAAGGGC CTTGTGCAAC CCACTCGACT TCTTTTGGAA TATCTTGAAG 
181 AAAAATATGA AGAGCATTTG TATGAGCGCG ATGAAGGTGA TAAATGGCGA AACAAAAAGT 
241 TTGAATTGGG TTTGGAGTTT CCCAATCTTC CTTATTATAT TGATGGTGAT GTTAAATTAA 
301 CACAGTCTAT GGC CATC ATA CGTTATATAG CTGACAAGCA CAACATGTTG GGTGGTTGTC 
361 CAAAAGAGCG TGCAGAGATT TCAATGCTTG AAGGAGCGGT TTTGGATATT AGATACGGTG 
421 TTTCGAGAAT TGCATATAGT AAAGACTTTG AAACTCTCAA AGTTGATTTT CTTAGCAAGC 
481 T AC CTGAAAT GCTGAAAATG TTCGAAGATC GTTTATGTCA TAAAACATAT TTAAATGGTG 
541 ATCATGTAAC CCATCCTGAC TTCATGTTGT ATGACGCTCT TGATGTTGTT TTATACATGG 
6 01 ACCCAATGTG C CTGGATGCG TTCCCAAAAT TAGTTTGTTT TAAAAAACGT ATTGAAGCTA 
661 TCCCACAAAT TGATAAGTAC TTGAAATCCA G C AAGT AT AT AGCATGGCCT TTGCAGGGCT 
721 GGCAAGCCAC GTTTGGTGGT GGCGACCATC CTCCAAAATC GGATCTGGTT CCGCGTCCAT 
781 GGTCGAATCA AACAAGTTTG TACAAAAAAG CTGAACGAGA AAC GTAAAAT GATATAAATA 
841 TCAATATATT AAATTAGATT TTGCATAAAA AAC AG AC T AC ATAATACTGT AAAACACAAC 
9 01 ATATC CAGTC ACTATGGCGG CCGCATTAGG CACCCCAGGC TTTACACTTT ATGCTTCCGG 
961 CTCGTATAAT GTGTGGATTT TGAGTTAGGA TCCGTCGAGA TTTTCAGGAG CTAAGGAAGC 
1021 TAAAATGGAG AAAAAAATCA CTGGATATAC CACCGTTGAT ATATCCCAAT GGCATCGTAA 
1081 AGAACATTTT GAGGCATTTC AGTCAGTTGC TCAATGTACC TATAACCAGA CCGTTCAGCT 
1141 GGATATTACG GCCTTTTTAA AGACCGTAAA GAAAAATAAG CACAAGTTTT ATCCGGCCTT 
12 01 TATTCACATT CTTGCCCGCC TGATGAATGC TCATC CGGAA TTC CGTATGG CAATGAAAGA 

12 61 CGGTGAGCTG GTGATATGGG ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC ATGAGCAAAC 
1321 TGAAACGTTT TCATCGCTCT GGAGTGAATA CCACGACGAT TTCCGGCAGT TTCTACACAT 

13 81 ATATTCGCAA GATGTGGCGT GTTACGGTGA AAACCTGGCC TATTTCCCTA AAGGGTTTAT 
1441 TGAGAATATG TTTTTCGTCT CAGCCAATCC CTGGGTGAGT TTCACCAGTT TTGATTTAAA 

15 01 CGTGGCCAAT ATGGACAACT TCTTCGCCCC CGTTTTCACC ATGGGCAAAT ATTATACGCA 
1561 AGGCGACAAG GTGCTGATGC CGCTGGCGAT TCAGGTTCAT CATGCCGTCT GTGATGGCTT 
1621 CCATGTCGGC AGAATGCTTA ATGAATTACA ACAGTACTGC GATGAGTGGC AGGGCGGGGC 

16 81 GTAAT CTAGA GGATCCGGCT TACTAAAAGC CAGATAACAG TATGCGTATT TGCGCGCTGA 
1741 TTTTTGCGGT ATAAGAATAT ATACTGATAT GTATAC C CG A AGTATGTCAA AAAGAGGTGT 
18 01 GCTATGAAGC AG C GTATTAC AGTGACAGTT GACAGCGACA GCTATCAGTT GCTCAAGGCA 
18 61 TATATGATGT CAATATCTCC GGTCTGGTAA G C AC AAC CAT GCAGAATGAA GCCCGTCGTC 
1921 TGCGTGCCGA ACGCTGGAAA GCGGAAAATC AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA 
1981 TTGAAATGAA CGGCTCTTTT GCTGACGAGA ACAGGGACTG GTGAAATGCA GTTTAAGGTT 
2 041 TACACCTATA AAAGAGAGAG CCGTTATCGT CTGTTTGTGG ATGTACAGAG TGATATTATT 
2101 GACACGCCCG GGCGACGGAT GGTGATCCCC CTGGCCAGTG CACGTCTGCT GTCAGATAAA 
2161 ^GTCTCCCGTG AACTTTACCC GGTGGTGCAT ATCGGGGATG AAAGCTGGCG C ATGATG AC C 
2 221 AC CGATATGG CCAGTGTGCC GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC 
22 81 CGCGAAAATG ACATCAAAAA CGCCATTAAC CTGATGTTCT GGGGAATATA AATGTCAGGC 
2 341 TCCCTTATAC ACAGC CAGTC TGCAGGTCGA CCATAGTGAC TGGATATGTT GTGTTTTACA 
24 01 GTATTATGTA GTCTGTTTTT TATGCAAAAT CTAATTTAAT ATATTGATAT TTAT AT C ATT 
2461 TTACGTTTCT CGTTCAGCTT TCTTGTACAA AGTGGTTTGA TTCGACCCGG GATCCGGCTG 
2 521 CTAACAAAGC CCGAAAGGAA GCTGAGTTGG CTGCTGCCAC CGCTGAGCAA TAACTAG C AT 
2 581 AACCCCTTGG GGCCTCTAAA CGGGTCTTGA GGGGTTTTTT GCTGAAAGGA GGAACTATAT 
2641 CCGGATATCC ACAGGACGGG TGTGGTCGCC ATGATCGCGT AGTCGATAGT GGCTCCAAGT - 



2701 AGCGAAGCGA GCAGGACTGG GCGGCGGCCA AAGCGGTCGG ACAGTGCTCC GAGAACGGGT 
2 761 G C G C AT AG AA ATTGCATCAA CGCATATAGC GCTAGCAGCA CGCCATAGTG ACTGGCGATG 
2 821 CTGTCGGAAT GGACGATATC CCGCAAGAGG CCCGGCAGTA CCGGCATAAC CAAGCCTATG 
2 881 CCTACAGCAT CCAGGGTGAC GGTGCCGAGG ATGACGATGA GCGCATTGTT AGATTTCATA 

2 941 CACGGTGCCT GACTGCGTTA GCAATTTAAC TGTGATAAAC TACCGCATTA AAGCTTATCG 

3 001 ATGATAAGCT GTCAAACATG AGAATT CTTG AAGACGAAAG GGCCTCGTGA TACGCCTATT 
3 061 TTTATAGGTT AATGTCATGA TAATAATGGT TTCTTAGACG TCAGGTGGCA CTTTTCGGGG 
3121 AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA TGTATCCGCT 
3181 CATGAGACAA TAAC C CTGAT AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATGAGTAT 
3241 TCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC 
3 3 01 TCACCCAGAA ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG 
3361 TTACATCGAA CTGGATCTCA ACAGCGGTAA GATCCTTGAG AGTTTTCGCC CCGAAGAACG 
34 21 TTTTCCAATG ATGAG C ACTT TTAAAGTTCT GCTATGTGGC GCGGTATTAT CCCGTGTTGA 
3481 CGCCGGGCAA GAGCAACTCG GTCGCCGCAT ACACTATTCT CAGAATGACT TGGTTGAGTA 
3 541 CTCACCAGTC ACAGAAAAGC ATCTTACGGA TGGCATGACA GTAAGAGAAT TATGCAGTGC 
36 01 TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT CTGACAACGA TCGGAGGACC 
3 661 GAAGGAGCTA ACCGCTTTTT TG C AC AAC AT GGGGGATCAT GTAACTCGCC TTGATCGTTG 
3721 GGAACCGGAG CTGAATGAAG C C ATAC C AAA CGACGAGCGT G AC AC C AC G A TGCCTGCAGC 
3 781 AATGGCAACA ACGTTGCGCA AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA 
3841 ACAATTAATA GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT 
3 901 TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT CTCGCGGTAT 

3 961 CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC GTAGTTATCT ACACGACGGG 

4 021 GAGTCAGGCA ACTATGGATG AACGAAATAG ACAGATCGCT GAGATAGGTG CCTCACTGAT 
4 081 TAAGCATTGG TAACTGTCAG AC C AAGTTT A CTCATATATA CTTTAGATTG ATTTAAAACT 
4141 TCATTTTTAA TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCT C A TGACCAAAAT 
42 01 CCCTTAACGT GAGTTTTCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC 
4 261 TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT 
4 321 ACCAGCGGTG GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG 
4 381 CTTCAGCAGA GCGCAGATAC CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA 
4441 CTTCAAGAAC TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 
4 5 01 TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA 
4 561 TAAGGCGCAG CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC 
4 621 GACCTACACC GAACTGAGAT ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA 
4681 AGGGAGAAAG GCGGACAGGT ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG 
4741 GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG 
48 01 ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG 
4 861 CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC 
4 921 TGCGTTATCC CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC 

4 981 TCGCCGCAGC CGAACGACCG AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCT 

5 041 GATGCGGTAT TTTCTCCTTA CGCATCTGTG CGGTATTTCA C AC C G CAT AT ATGGTG CACT 
5101 CTCAGTACAA TCTGCTCTGA TGCCGCATAG TTAAGCCAGT ATACACTCCG CTATCGCTAC 
5161 GTGACTGGGT CATGGCTGCG CCCCGACACC CGCCAACACC CGCTGACGCG CCCTGACGGG 
5221 CTTGTCTGCT CCCGGCATCC GCTTACAGAC AAGCTGTGAC CGTCTCCGGG AGCTGCATGT 
52 81 GTCAGAGGTT TTCACCGTCA TCACCGAAAC GCGCGAGGCA GCTGCGGTAA AGCTCATCAG 
5341 CGTGGTCGTG AAGCGATTCA CAGATGTCTG CCTGTTCATC CGCGTCCAGC TCGTTGAGTT 

'5401 TCTCCAGAAG CGTTAATGTC TGGCTTCTGA TAAAGCGGGC CATGTTAAGG GCGGTTTTTT 
54 61 CCTGTTTGGT CACTGATGCC TCCGTGTAAG GGGGATTTCT GTTCATGGGG GTAATGATAC 
5521 CGATGAAACG AGAGAGGATG CTCACGATAC GGGTTACTGA TGATGAACAT GCCCGGTTAC 
5 581- TGGAACGTTG TGAGGGTAAA CAACTGGCGG TATGGATGCG GCGGGACCAG AGAAAAATCA 
5 641 CTCAGGGTCA ATGCCAGCGC TTCGTTAATA CAGATGTAGG TGTTCCACAG GGTAGCCAGC 
5701 AGCATCCTGC GATGCAGATC C GG AAC AT AA TGGTGCAGGG CGCTGACTTC CGCGTTTCCA 
5761 GACTTTACGA AACACGGAAA CCGAAGACCA TTCATGTTGT TGCTCAGGTC GCAGACGTTT 
5 821 TGCAGCAGCA GTCGCTTCAC GTTCGCTCGC GTATCGGTGA TTCATTCTGC TAAC C AGTAA 
5881 GGCAACCCCG CCAGCCTAGC CGGGTCCTCA AC GAC AGGAG CACGATCATG CGCACCCGTG 

5 941 GCCAGGACCC AACGCTGCCC GAGATGCGCC GCGTGCGGCT GCTGGAGATG GCGGACGCGA 

6 001 TGGATATGTT CTGCCAAGGG TTGGTTTGCG CATTCACAGT TCTCCGCAAG AATTGATTGG 
6 061 CTCCAATTCT TGGAGTGGTG AATCCGTTAG CGAGGTGCCG CCGGCTTCCA TTCAGGTCGA 
6121 GGTGGCCCGG CTCCATGCAC CGCGACGCAA CGCGGGGAGG CAGACAAGGT ATAGGGCGGC - 




6181 GCCTACAATC CATGCCAACC CGTTCCATGT GCTCGCCGAG GCGGCATAAA TCGCCGTGAC 
6241 GATCAGCGGT CCAGTGATCG AAGTTAGGCT GGTAAGAGCC GCGAGCGATC CTTGAAGCTG 
6 3 01 TCCCTGATGG TCGTCATCTA CCTGCCTGGA CAGCATGGCC TGCAACGCGG GCATCCCGAT 
6 361 GCCGCCGGAA GCGAGAAGAA TCATAATGGG GAAGGCCATC CAGCCTCGCG TCGCGAACGC 
6421 CAGCAAGACG TAGCCCAGCG CGTCGGCCGC CATGCCGGCG ATAATGGCCT GCTTCTCGCC 
64 81 GAAACGTTTG GTGG CGGGAC CAGTGACGAA GGCTTGAGCG AGGGCGTGCA AGATTCCGAA 
6 541 TACCGCAAGC GACAGGCCGA TCATCGTCGC GCTCCAGCGA AAGCGGTCCT CGCCGAAAAT 
66 01 G AC C C AG AG C GCTGCCGGCA CCTGTCCTAC GAGTTG C ATG ATAAAGAAGA CAGTCATAAG 
66 61 TGCGGCGACG ATAGTC ATG C CCCGCGCCCA CCGGAAGGAG CTGACTGGGT TGAAGGCTCT 
6 721 CAAGGGCATC GGTCGATCGA CGCTCTCCCT TATGCGACTC CTGCATTAGG AAGCAGCCCA 
6781 GTAGTAGGTT GAGGCCGTTG AGCACCGCCG CCGCAAGGAA TGGTGCATGC AAGGAGATGG 
6841 CGCCCAACAG TCCCCCGGCC ACGGGGCCTG CCACCATACC CACGCCGAAA CAAGCGCTCA 
6 9 01 TGAGCCCGAA GTGGCGAGCC CGATCTTCCC CATCGGTGAT GTCGGCGATA TAGGCGCCAG 
6 961 CAACCGCACC TGTGGCGCCG GTGATGCCGG CCACGATGCG TCCGGCGTAG AGG 
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pDEST16 

Location (Base Nos . ) 
104 . .457 
585 . . 461 
694 . . 1353 
1473 . . 1557 
1695 . .2000 
2041 . .2165 




6675 bp 

Gene Encoded 

trxA 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 



1 AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG AG AC C AC AAC GGTTTCCCTC 
61 TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACAT ATGAGCGATA AAATTATTCA 
121 CCTGACTGAC G AC AG TTTTG ACACGGATGT ACTCAAAGCG GACGGGGCGA TCCTCGTCGA 
181 TTTCTGGGCA GAGTGGTGCG GTCCGTGCAA AATGATCGCC CCGATTCTGG ATGAAATCGC 
241 TGACGAATAT CAGGGCAAAC TGACCGTTGC AAAACTGAAC ATCGATCAAA ACCCTGGCAC 
3 01 TGCGCCGAAA TATGGCATCC GTGGTATCCC GACTCTGCTG CTGTTCAAAA ACGGTGAAGT 
361 GGCGGCAACC AAAGTGGGTG CACTGTCTAA AGGTCAGTTG AAAGAGTTCC TCGACGCTAA 
421 CCTGGCCGGT TCTGGTTCTG GTGATGACGA TGACAAGATC ACAAGTTTGT ACAAAAAAGC 
481 TGAACGAGAA ACGTAAAATG ATATAAATAT CAATATATTA AATTAGATTT TGCATAAAAA 
541 ACAGACTACA TAATACTGTA AAACACAACA TATCCAGTCA CTATGGCGGC CGCATTAGGC 
6 01 ACCCCAGGCT TT AC AC TTTA TGCTTCCGGC TCGTATAATG TGTGGATTTT GAGTTAGGAT 
6 61 CCGGCGAGAT TTTCAGGAGC TAAGGAAGCT AAAATGGAGA AAAAAATCAC TGGATATACC 
721 ACCGTTGATA TATCCCAATG G CAT CGTAAA GAACATTTTG AGGCATTTCA GTCAGTTGCT 
781 CAATGTACCT ATAAC C AG AC CQTTCAGCTG GATATTACGG CCTTTTTAAA GACCGTAAAG 
841 AAAAATAAGC ACAAGTTTTA TCCGGCCTTT ATTCACATTC TTGCCCGCCT GATGAATGCT 
9 01 CATCCGGAAT TCCGTATGGC AATGAAAGAC GGTGAGCTGG TGATATGGGA TAGTGTT CAC 
961 C C TTGTTAC A CCGTTTTCCA TGAGCAAACT GAAACGTTTT CATCGCTCTG GAGTGAATAC 
1021 CACGACGATT TCCGGCAGTT TCTACACATA TATTCGCAAG ATGTGGCGTG TTACGGTGAA 
1081 AACCTGGCCT ATTTCCCTAA AGGGTTTATT GAGAATATGT TTTTCGTCTC AGCCAATCCC 
1141 TGGGTGAGTT TCACCAGTTT TGATTTAAAC GTGGCCAATA TGGACAACTT CTTCGCCCCC 
12 01 GTTTTCAC C A TGGGCAAATA TTATACGCAA GGCGACAAGG TGCTGATGCC GCTGGCGATT 

12 61 CAGGTTCATC ATGCCGTCTG TGATGGCTTC CATGTCGGCA GAATGCTTAA TGAATTACAA 
1321 CAGTACTGCG ATGAGTGGCA GGGCGGGGCG TAAACGCGTG GATCCGGCTT ACTAAAAGCC 

13 81 AGATAACAGT ATGCGTATTT GCGCGCTGAT TTTTGCGGTA TAAGAATATA TACTGATATG 
1441 TATACCCGAA GTATGTCAAA AAGAGGTGTG CTATGAAGCA GCGTATTACA GTGACAGTTG 
1501 ACAGCGACAG CTATCAGTTG CTCAAGGCAT ATATGATGTC AATATCTCCG GTCTGGTAAG 
1561 CACAACCATG CAGAATGAAG CCCGTCGTCT GCGTGCCGAA CGCTGGAAAG CGGAAAATCA 
1621 GGAAGGGATG GCTGAGGTCG CCCGGTTTAT TGAAATGAAC GGCTCTTTTG CTGACGAGAA 
16 81 CAGGGACTGG TGAAATGCAG TTTAAGGTTT ACAC CTATAA AAG AG AG AG C CGTTATCGTC 
1741 TGTTTGTGGA TGTACAGAGT GATATTATTG ACACGCCCGG GCGACGGATG GTGATCCCCC 
18 01 TGGCCAGTGC ACGTCTGCTG TCAGATAAAG TCTCCCGTGA ACTTTACCCG GTGGTGCATA 
1861 TCGGGGATGA AAGCTGGCGC ATGATG AC C A CCGATATGGC CAGTGTGCCG GTCTCCGTTA 
1921 TCGGGGAAGA AGTGGCTGAT CTCAGCCACC GCGAAAATGA CATCAAAAAC GCCATTAACC 
1981 TGATGTTCTG GGGAATATAA ATGTCAGGCT C C CTTATACA CAGCCAGTCT GCAGGTCGAC 
2 041 CATAGTGACT G G AT ATGTTG TGTTTTACAG TATTATGTAG TCTGTTTTTT ATGCAAAATC 
21 ;/ 01 TAATTTAATA TATTGATATT TATATCATTT TACGTTTCTC GTTCAGCTTT CTTGTACAAA 
2161 GTGGTGATGA TCCGGCTGCT AACAAAGCCC GAAAGGAAGC TGAGTTGGCT GCTGCCACCG 
2221 CTGAGCAATA AC TAG C ATAA CCCCTTGGGG CCTCTAAACG GGTCTTGAGG GGTTTTTTGC 
22 81 TGAAAGGAGG AAC TATATC C GGATATC CAC AGGACGGGTG TGGTCGCCAT GATCGCGTAG 
2341 TCGATAGTGG CTCCAAGTAG CGAAGCGAGC AGGACTGGGC GGCGGCCAAA GCGGTCGGAC 
24 01 AGTGCTCCGA GAACGGGTGC GCATAGAAAT TGCATCAACG CATATAGCGC TAGCAGCACG 
24 61 C C ATAGTGAC TGGCGATGCT GTCGGAATGG ACGATATCCC GCAAGAGGCC CGGCAGTACC 
2 521 GGCATAACCA AGCCTATGCC TACAGCATCC AGGGTGACGG TGCCGAGGAT GACGATGAGC 
2 581 GCATTGTTAG ATTTCATACA CGGTGCCTGA CTGCGTTAGC AATTTAACTG TGATAAACTA 
2 641 CCGCATTAAA GCTTATCGAT GATAAGCTGT CAAACATGAG AATTCTTGAA GACGAAAGGG 
2 701 CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA ATAATGGTTT CTTAGACGTC 
2 761 AGGTGGCACT TTTCGGGGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT TCTAAATACA- 



2 821 TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT AATATTGAAA 

2881 AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 

2 941 TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG CTGAAGATCA 

3 001 GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA TCCTTGAGAG 
3 061 TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC TATGTGGCGC 
3121 GGTATTATCC CGTGTTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA 
3181 GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 
3241 AAGAGAATTA TGCAGTGCTG C C ATAAC CAT GAGTGATAAC ACTGCGGCCA ACTTACTTCT 
3 3 01 GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG GGGATCATGT 
3 361 AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG ACGAGCGTGA 
3421 CACCACGATG CCTGCAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT 
34 81 TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 
3541 ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAAT CTG GAGCCGGTGA 
36 01 GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT GGTAAGCCCT CCCGTATCGT 
3661 AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC AGATCGCTGA 
3721 GATAGGTGCC TCACTGATTA AG C ATTGGTA ACTGTCAGAC CAAGTTTACT CATATATACT 
3 781 TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA TCCTTTTTGA 
3 841 TAATCTCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT 
3 901 AG AAAAG AT C AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA 

3 961 AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT 

4 021 TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC TTCTAGTGTA 
4 081 GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT 
4141 AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC 
42 01 AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA 
42 61 GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG AGCTATGAGA 
4321 AAGCGCCACG CTTCCCGAAG GGAGAAAGGC G G AC AGGTAT CCGGTAAGCG GCAGGGTCGG 
4381 AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT 
4441 CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG 
4 5 01 CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT 
4 561 TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAACCGTA TTACCGCCTT 
4 621 TGAGTGAGCT GATACCGCTC GCCGCAGCCG AACGACCGAG CGCAGCGAGT CAGTGAGCGA 
4 681 GGAAGCGGAA GAGCGCCTGA TGCGGTATTT TCTCCTTACG CATCTGTGCG GTATTTCACA 
4 741 CCGCATATAT GGTGCACTCT CAGTACAATC TGCTCTGATG CCGCATAGTT AAGCCAGTAT 
48 01 ACACTCCGCT ATCGCTACGT GACTGGGTCA TGGCTGCGCC CCGACACCCG CCAACACCCG 
4861 CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC TTACAGACAA GCTGTGACCG 
4 921 TCTCCGGGAG CTGCATGTGT CAGAGGTTTT CACCGTCATC ACCGAAACGC GCGAGGCAGC 

4 981 TGCGGTAAAG CTCATCAGCG TGGTCGTGAA G CGATTCAC A GATGTCTGCC TGTTCATCCG 

5 041 CGTCCAGCTC GTTGAGTTTC TCCAGAAGCG TTAATGTCTG G CTTCTG ATA AAGCGGGCCA 
5101 TGTTAAGGGC GGTTTTTTCC TGTTTGGTCA CTGATGCCTC CGTGTAAGGG GGATTTCTGT 
5161 TCATGGGGGT AATGATACCG ATG AAAC GAG AGAGGATGCT CACGATACGG GTTACTGATG 
5221 ATG AAC ATG C CCGGTTACTG GAACGTTGTG AGGGTAAACA ACTGGCGGTA TGGATGCGGC 
5281 GGGACCAGAG AAAAATCACT CAGGGTCAAT GCCAGCGCTT CGTTAATACA GATGTAGGTG 
5341 TTC CACAGGG TAG C C AGCAG CATCCTGCGA TGCAGATCCG GAACATAATG GTGCAGGGCG 
54 01 CTGACTTCCG CGTTTCCAGA CTTTACGAAA CACGGAAACC GAAGACCATT CATGTTGTTG 
54 61 CTCAGGTCGC AGACGTTTTG CAGCAGCAGT CGCTTCACGT TCGCTCGCGT ATCGGTGATT 

/ 5'521 CATTCTGCTA ACCAGTAAGG CAACCCCGCC AGCCTAGCCG GGTCCTCAAC GACAGGAGCA 

5581 CGATCATGCG CACCCGTGGC CAGGACCCAA CGCTGCCCGA GATGCGCCGC GTGCGGCTGC 

5641 TGGAGATGGC GGACGCGATG GATATGTTCT GCCAAGGGTT GGTTTGCGCA TTCACAGTTC 

5701, TCCGCAAGAA TTGATTGGCT CCAATTCTTG GAGTGGTGAA TCCGTTAGCG AGGTGCCGCC 

5761 'GGCTTCCATT CAGGTCGAGG TGGCCCGGCT CCATGCACCG CGACGCAACG CGGGGAGGCA 

5821 GACAAGGTAT AGGGCGGCGC C TACAATC C A TGCCAACCCG TTCCATGTGC TCGCCGAGGC 

58 81 GGCATAAATC GCCGTGACGA TCAGCGGTCC AGTGATCGAA GTTAGGCTGG TAAGAGCCGC 

5 941 GAGCGATCCT TGAAGCTGTC CCTGATGGTC GTCATCTACC TGCCTGGACA GCATGGCCTG 

6 001 CAACGCGGGC ATCCCGATGC CGCCGGAAGC GAGAAGAATC ATAATGGGGA AGGCCATCCA 
6 061 GCCTCGCGTC GCGAACGCCA GCAAGACGTA GCCCAGCGCG TCGGCCGCCA TGCCGGCGAT 
6121 AATGGCCTGC TTCTCGCCGA AAC GTTTGGT GGCGGGACCA GTGACGAAGG CTTGAGCGAG 
6181 GGCGTGCAAG ATTCCGAATA CCGCAAGCGA CAGGCCGATC ATCGTCGCGC TCCAGCGAAA 
6241 GCGGTCCTCG CCGAAAATGA CCCAGAGCGC TGCCGGCACC TGTCCTACGA GTTGCATGAT - 



6 301 AAAGAAGACA GTCATAAGTG CGGCGACGAT AGTCATGCCC CGCGCCCACC GGAAGGAGCT 

6 361 GACTGGGTTG AAGGCTCTCA AGGGCATCGG TCGATCGACG CTCTCCCTTA TGCGACTCCT 

6 421 GCATTAGGAA GCAGCCCAGT AGTAGGTTGA GGCCGTTGAG CACCGCCGCC GCAAGGAATG 

64 81 GTGCATGCAA GGAGATGGCG CCCAACAGTC CCCCGGCCAC GGGGCCTGCC ACCATACCCA 

6 541 CGCCGAAACA AGCGCTCATG AGCCCGAAGT GGCGAGCCCG ATCTTCCCCA TCGGTGATGT 

66 01 CGGCGATATA GGCGCCAGCA ACCGCACCTG TGGCGCCGGT GATGCCGGCC ACGATGCGTC 
6 661 CGGCGTAGAG GATCG 
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Bsg I 5070 



SgrA 16313 

Sph\ 6162 

EcoN I 6097 
PshA I 6004 

Nru I 5748 \ 



Xba I 59 

Nde I 98 

Nof I 265 



TtM 1 1 I 4500 
BsaA I 4495 
Sap I 4367 

BspLU11 I 4245 

>4fl II 4245 




Nco I 887 

BssH II 1080 

S/f I 1454 
Sma\ 1455 
BsfX 11 567 

Sa/I 1714 
Spu1102 I 1899 

Mel 2128 

H/nd III 2328 
C/a I 2334 
Aat II 2434 



Eam1105 I 3352 





pDEST17 6354 bp 



Tinea f. ion (Base Nos.) Gene Encoded 

258 . . 134 
367 . . 1026 
1146 . .1230 
1368 . . 1673 
1714 . . 1838 
2564 . . 3421 

1 CGATCCCGCG AAATTAATAC GACTCACTAT AGGGAGACCA CAACGGTTTC CCTCTAGAAA 
61 TAATTTTGTT TAACTTTAAG AAGGAGATAT ACATATGTCG TACTACCATC ACCATCACCA 
121 TCACCTCGAA TCAACAAGTT TGTACAAAAA AGCTGAACGA GAAACGTAAA ATGATATAAA 
181 TATCAATATA TTAAATTAGA TTTTGCATAA AAAACAGACT ACATAATACT GTAAAACACA 
241 AC AT AT C C AG TCACTATGGC GGCCGCATTA GGCACCCCAG GCTTTACACT TTATGCTTCC 
3 01 GGCTCGTATA ATGTGTGGAT TTTGAGTTAG GATCCGTCGA GATTTTCAGG AG CTAAGG AA 
361 GCTAAAATGG AGAAAAAAAT CACTGGATAT ACCACCGTTG ATATATCCCA ATGGCATCGT 
421 AAAGAACATT TTGAGGCATT TCAGTCAGTT GCTCAATGTA CCTATAACCA GACCGTTCAG 
481 CTGGATATTA CGGCCTTTTT AAAGACCGTA AAGAAAAATA AGCACAAGTT TTATCCGGCC 
541 TTTATTCACA TTCTTGCCCG CCTGATGAAT GCTCATCCGG AATTCCGTAT GGCAATGAAA 
6 01 GACGGTGAGC TGGTGATATG GGATAGTGTT CACCCTTGTT AC AC CGTTTT CCATGAGCAA 
661 ACTGAAACGT TTTCATCGCT CTGGAGTGAA TACCACGACG ATTTCCGGCA GTTTCTACAC 
721 ATATATTCGC AAGATGTGGC GTGTTACGGT G AAAAC CTGG CCTATTTCCC TAAAGGGTTT 
781 ATTGAGAATA TGTTTTTCGT CTCAGCCAAT CCCTGGGTGA GTTTCACCAG TTTTGATTTA 
841 AACGTGGCCA ATATGGACAA CTTCTTCGCC CCCGTTTTCA CCATGGGCAA ATATTATACG 
901 CAAGGCGACA AGGTGCTGAT GCCGCTGGCG ATTC AGGTT C ATCATGCCGT CTGTGATGGC 
961 TTCCATGTCG GCAGAATGCT TAATGAATTA CAACAGTACT GCGATGAGTG GCAGGGCGGG 
1021 GCGTAAAGAT CTGGATCCGG CTTACTAAAA GCCAGATAAC AGTATGCGTA TTTGCGCGCT 
1081 GATTTTTGCG GTATAAGAAT ATATACTGAT ATGTATACCC GAAGTATGTC AAAAAGAGGT 
1141 GTGCTATGAA GCAGCGTATT ACAGTGACAG TTG AC AG C G A CAGCTATCAG TTGCTCAAGG 

12 01 CATATATGAT GTCAATATCT CCGGTCTGGT AAGCACAACC ATGCAGAATG AAGCCCGTCG 
1261 TCTGCGTGCC GAACGCTGGA AAGCGGAAAA TCAGGAAGGG ATGGCTGAGG TCGCCCGGTT 
1321 TATTGAAATG AACGGCTCTT TTGCTGACGA GAACAGGGAC TGGTGAAATG CAGTTTAAGG 

13 81 TTTAC AC CTA TAAAAGAGAG AGCCGTTATC GTCTGTTTGT GGATGTACAG AGTGATATTA 

14 41 TTGACACGCC CGGGCGACGG ATGGTGATCC CCCTGGCCAG TGCACGTCTG CTGTCAGATA 

15 01 AAGTCTCCCG TGAACTTTAC CCGGTGGTGC ATATCGGGGA TGAAAGCTGG CGCATGATGA 
1561 CCACCGATAT GGCCAGTGTG CCGGTCTCCG TTATCGGGGA AG AAGTGG CT GATCTCAGCC 
1621 ACCGCGAAAA TGACATCAAA AACGCCATTA ACCTGATGTT CTGGGGAATA TAAATGTCAG 

16 81 GCTCCCTTAT AC AC AG C C AG TCTGCAGGTC GACCATAGTG ACTGGATATG TTGTGTTTTA 
1741 CAGTATTATG TAGTCTGTTT TTTATG C AAA ATCTAATTTA ATATATTGAT ATTT AT AT C A 
18 01 TTTTACGTTT CTCGTTCAGC TTTCTTGTAC AAAGTGGTTG ATTCGAGGCT GCTAACAAAG 
18 61 CCCGAAAGGA AGCTGAGTTG GCTGCTGCCA CCGCTGAGCA ATAACTAGCA TAACCCCTTG 
1921 GGGCCTCTAA ACGGGTCTTG AGGGGTTTTT TGCTGAAAGG AGGAACTATA TCCGGATATC 

'igSl CACAGGACGG GTGTGGTCGC CATGATCGCG TAGTCGATAG TGGCTCCAAG TAGCGAAGCG 
2 041 AGCAGGACTG GGCGGCGGCC AAAGCGGTCG GACAGTGCTC CGAGAACGGG TGCGCATAGA 
2101 AATTG C ATCA ACGCATATAG CGCTAGCAGC ACGCCATAGT GACTGGCGAT GCTGTCGGAA 
216r-TGGACGATAT CCCGCAAGAG GCCCGGCAGT ACCGGCATAA CCAAGCCTAT GCCTACAGCA 
2221 TCCAGGGTGA CGGTGCCGAG GATGACGATG AGCGCATTGT TAGATTTCAT ACACGGTGCC 
22 81 TGACTGCGTT AGCAATTTAA CTGTGATAAA CTACCGCATT AAAGCTTATC GATGATAAGC 
2 341 TGTCAAACAT GAGAATTCTT G AAG AC G AAA GGGCCTCGTG ATACGCCTAT TTTTATAGGT 
24 01 TAATGTCATG ATAATAATGG TTTCTTAGAC GTCAGGTGGC ACTTTTCGGG GAAATGTGCG 
24 61 CGGAACCCCT ATTTGTTTAT TTTTCTAAAT ACATTCAAAT ATGTATCCGC TCATGAGACA 
2521 ATAACCCTGA TAAATGCTTC AATAATATTG AAAAAGGAAG AGTATGAGTA TTCAACATTT 
2 581 CCGTGTCGCC CTTATTCCCT TTTTTGCGGC ATTTTGCCTT CCTGTTTTTG CTCACCCAGA 
2641 AACGCTGGTG AAAGTAAAAG ATGCTGAAGA TCAGTTGGGT GCACGAGTGG GTTAC ATCG A - 



attRl 
CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 



27 01 ACTGGATCTC AACAGCGGTA AGATC CTTGA GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT 
2761 GATGAGCACT TTTAAAGTTC TGCTATGTGG CGCGGTATTA TCCCGTGTTG ACGCCGGGCA 
2821 AGAGCAACTC GGTCGCCGCA TACACTATTC TCAGAATGAC TTGGTTGAGT ACTCACCAGT 

28 81 CACAGAAAAG CATCTTACGG ATGGCATGAC AGTAAGAGAA TTATGCAGTG CTGCCATAAC 

2 941 CATGAGTGAT AACACTGCGG CCAACTTACT TCTGACAACG ATCGGAGGAC CGAAGGAGCT 

3 0 01 AACCGCTTTT TTGCACAACA TGGGGGATCA TGTAACTCGC CTTGATCGTT GGGAACCGGA 
3 061 GCTGAATGAA GCCATACCAA ACGACGAGCG TGACACCACG ATGCCTGCAG CAATGGCAAC 
3121 AACGTTGCGC AAACTATTAA CTGGCGAACT ACTTACTCTA GCTTCCCGGC AACAATTAAT 
3181 AGACTGGATG GAGGCGGATA AAGTTGCAGG ACCACTTCTG CGCTCGGCCC TTCCGGCTGG 
3241 CTGGTTTATT GCTGATAAAT CTGGAGCCGG TGAGCGTGGG TCTCGCGGTA TCATTGCAGC 
3 3 01 ACTGGGGCCA GAT GGTAAG C CCTCCCGTAT CGTAGTTATC TACACGACGG GGAGTCAGGC 
3361 AACTATGGAT G AAC GAAATA GACAGATCGC TGAGATAGGT GCCTCACTGA TTAAGCATTG 
3421 GTAACTGTCA G AC C AAGTTT ACTCATATAT ACTTTAGATT GATTTAAAAC TTCATTTTTA 
34 81 ATTTAAAAGG AT C TAGGTG A AGATC CTTTT TGATAATCTC ATGACCAAAA TCCCTTAACG 
3 541 TGAGTTTTCG TTCCACTGAG CGTCAGACCC C GTAGAAAAG AT C AAAGGAT CTTCTTGAGA 
3 6 01 TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT GCAAACAAAA AAACCACCGC TACCAGCGGT 
3661 GGTTTGTTTG CCGGATCAAG AGCTAC C AAC TCTTTTTCCG AAGGTAACTG GCTTCAGCAG 
3 721 AGCGCAGATA CCAAATACTG TCCTTCTAGT GTAGCCGTAG TTAGGCCACC ACTTCAAGAA 
3 781 CTCTGTAGCA CCGCCTACAT ACCTCGCTCT GCTAATCCTG TTACCAGTGG CTGCTGCCAG 
3 841 TGG CGATAAG TCGTGTCTTA CCGGGTTGGA CTCAAGACGA TAGTTACCGG ATAAGGCGCA 
3 901 GCGGTCGGGC TGAACGGGGG GTTCGTGCAC ACAGCCCAGC TTGGAGCGAA CGACCTACAC 

3 961 CGAACTGAGA TAC C T AC AG C GTGAGCTATG AGAAAGCGCC ACGCTTCCCG AAGGGAGAAA 

4 021 GGCGGACAGG TATCCGGTAA GCGGCAGGGT C G G AAC AG G A GAGCGCACGA GGGAGCTTCC 
4 081 AGGGGGAAAC GCCTGGTATC TTTATAGTCC TGTCGGGTTT CGCCACCTCT GACTTGAGCG 
4141 TCGATTTTTG TGATGCTCGT CAGGGGGGCG GAGCCTATGG AAAAACGCCA GCAACGCGGC 
42 01 CTTTTTACGG TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC ATGTTCTTTC CTGCGTTATC 

42 61 CCCTGATTCT GTGGATAACC GTATTACCGC CTTTGAGTGA GCTGATACCG CTCGCCGCAG 
4321 CCGAACGACC GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG GAAGAGCGCC TGATGCGGTA 

43 81 TTTTCTCCTT ACGCATCTGT GCGGTATTTC ACACCGCATA TATGGTGCAC TCTCAGTACA 
4441 ATCTGCTCTG ATGCCGCATA GTTAAGCCAG TATACACTCC GCTATCGCTA CGTGACTGGG 
4501 TCATGGCTGC GCCCCGACAC CCGCCAACAC CCGCTGACGC GCCCTGACGG GCTTGTCTGC 
4 561 TCCCGGCATC CGCTTACAGA CAAGCTGTGA CCGTCTCCGG GAGCTGCATG TGTCAGAGGT 
4 621 TTTCACCGTC ATCACCGAAA CGCGCGAGGC AGCTGCGGTA AAGCTCATCA GCGTGGTCGT 
4 681 GAAGCGATTC ACAGATGTCT GCCTGTTCAT CCGCGTCCAG CTCGTTGAGT TTCTCCAGAA 
4 741 GCGTTAATGT CTGGCTTCTG ATAAAG CGGG CCATGTTAAG GGCGGTTTTT TCCTGTTTGG 
48 01 TCACTGATGC CTCCGTGTAA GGGGGATTTC TGTTCATGGG GGTAATGATA CCGATGAAAC 
4861 GAGAGAGGAT G C T C AC G ATA CGGGTTACTG ATGATGAACA TGCCCGGTTA CTGGAACGTT 
4 921 GTGAGGGTAA ACAACTGGCG GTATGGATGC GGCGGGACCA GAGAAAAATC ACTCAGGGTC 

4 981 AATGCCAGCG CTTCGTTAAT ACAGATGTAG GTGTTCCACA GGGTAGCCAG CAGCATCCTG 

5 041 C G ATG C AG AT CCGGAACATA ATGGTGCAGG GCGCTGACTT CCGCGTTTCC AGACTTTACG 
5101 AAACACGGAA AC CGAAGAC C ATTCATGTTG TTGCTCAGGT CGCAGACGTT TTGCAGCAGC 
5161 AGTCGCTTCA CGTTCGCTCG CGTATCGGTG ATTCATTCTG CTAACCAGTA AGGCAACCCC 
52 21 GCCAGCCTAG CCGGGTCCTC AACGACAGGA GCACGATCAT GCGCACCCGT GGCCAGGACC 
52 81 CAACGCTGCC C GAG ATG C G C CGCGTGCGGC TGCTGGAGAT GGCGGACGCG ATGGATATGT 
,5341 TCTGCCAAGG GTTGGTTTGC G C ATT C AC AG TTCTCCGCAA GAATTGATTG GCTCCAATTC 
'54 01 TTGGAGTGGT GAATCCGTTA GCGAGGTGCC GCCGGCTTCC ATTCAGGTCG AGGTGGCCCG 

54 61 GCTCCATGCA CCGCGACGCA ACGCGGGGAG GCAGACAAGG TATAGGGCGG CGCCTACAAT 
5521 CCATGCCAAC CCGTTCCATG TGCTCGCCGA GGCGGCATAA ATCGCCGTGA CGATCAGCGG 
5581* TCCAGTGATC GAAGTTAGGC TGGTAAGAGC CGCGAGCGAT CCTTGAAGCT GTCCCTGATG 
5641 GTCGTCATCT ACCTGCCTGG ACAGCATGGC CTGCAACGCG GGCATCCCGA TGCCGCCGGA 

57 01 AGCGAGAAGA AT C ATAATGG GGAAGGCCAT CCAGCCTCGC GTCGCGAACG CCAGCAAGAC 
5761 GTAGCCCAGC GCGTCGGCCG CCATGCCGGC G ATAATGG C C TGCTTCTCGC CGAAACGTTT 
5821 GGTGGCGGGA CCAGTGACGA AGGCTTGAGC GAGGGCGTGC AAGATTCCGA ATACCGCAAG 

58 81 CGACAGGCCG ATCATCGTCG CGCTCCAGCG AAAGCGGTCC TCGCCGAAAA TG AC C C AG AG 
5941, CGCTGCCGGC ACCTGTCCTA CGAGTTGCAT GATAAAGAAG ACAGT CATAA GTGCGGCGAC 

6 001 GATAGTCATG CCCCGCGCCC ACCGGAAGGA GCTGACTGGG TTGAAGGCTC TCAAGGGCAT 
6 061 CGGTCGATCG ACGCTCTCCC TTATGCGACT CCTGCATTAG GAAGCAGCCC AGTAGTAGGT 
6121 TGAGGCCGTT GAGCACCGCC GCCGCAAGGA ATGGTGCATG CAAGGAGATG GCGCCCAACA- 




6181 GTCCCCCGGC CACGGGGCCT GCCACCATAC 
6241 AGTGGCGAGC CCGATCTTCC CCATCGGTGA 
63 01 CTGTGGCGCC GGTGATGCCG GCCACGATGC 



CCACGCCGAA ACAAGCGCTC ATGAGCCCGA 
TGTCGGCGAT ATAGGCGCCA GCAACCGCAC 
GTCCGGCGTA GAGGATCGAG ATCT 
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6 01 


CATTTCCGTG 


TGGGGGTTA1 




6 61 


C C AG AAAC G C 


i-p/~i prpp 7V TV TV rrp 

TGGTGAAAG 1 




721 


AT C G AAC TG G 


TV rp/^irp/~1 7\ 7\ fl 71 f* 

AT G T G AAG AG 


5a: 


781 


C CAATGATGA 


/-1/*l7i /"II 1 II 1 It 1 It 1 I TV TV 

GGAGTTTTAA 




841 


GGGCAAGAGC 


AACTCGGTCG 




901 


CCAGTCACAG 


TV TV TV TV -y» m 1 1 1 

AAAAGCATCT 


J ! 1 


961 


ATAACCATGA 


rrt v**4 tv m tv tv tv 

GTGATAACAC 




1021 


GAGCTAACCG 


CTTTTTTGCA 


I ; 


1081 


CCGGAGCTGA 


TV FT 1 X" "4 TV TV ^4 y — < m m. 1 f 1 

ATG AAG C CAT 


jfl 1 


1141 


GCAACAACGT 


TGCGCAAACT 




1201 


TTAATAGACT 


GGATGGAGGC 




1261 


GCTGGCTGGT 


TTATTGCTGA 




1321 


GCAGCACTGG 


GGC CAGATGG 




1381 


CAGGCAACTA 


TGGATGAACG 




1441 


CATTGGTAAC 


TGTCAGACCA 




1501 


TTTTAATTTA 


AAAGGATCTA 




1561 


TAAC GTGAGT 


TTTCGTTCCA 




1621 


TGAGATCCTT 


TTTTTCTGCG 




1681 


GCGGTGGTTT 


GTTTGCCGGA 




,1741 


AGCAGAGCGC 


AGATAC C AAA 




1801 


AAGAACTCTG 


TAGCACCGCC 




1861 


GCCAGTGGCG 


ATAAGTCGTG 




1921 


GCGCAGCGGT 


CGGGCTGAAC 




1981* 


. TACACCGAAC 


TG AGATAC CT 




2041 


AGAAAGGCGG 


AC AGGTATC C 




2101 


CTTCCAGGGG 


GAAACGCCTG 




2161 


GAGCGTCGAT 


TTTTGTGATG 




2221 


GCGGCCTTTT 


TACGGTTCCT 




2281 


TTATCCCCTG 


ATTCTGTGGA 




2341 


CGCAGCCGAA 


CGACCGAGCG 




2401 


CGGTATTTTC 


TCCTTACGCA 




2461 


GGCAAAATCG 


GTTACGGTTG 



ATTAAGCGCG GCGGGTGTGG TGGTTACGCG CAGCGTGACC 
AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC 
TCAAGCTCTA AATCGGGGGC TCCCTTTAGG GTTCCGATTT 
CCCCAAAAAA CTTGATTAGG GTGATGGTTC ACGTAGTGGG 
TTTTCGCCCT TTGACGTTGG AGTCCACGTT CTTTAATAGT 
AACAACACTC AACCCTATCT CGGTCTATTC TTTTGATTTA 
GGCCTATTGG TTAAAAAATG AG C TG ATTTA ACAAAAATTT 
ATTAACGTTT AC AATTT C AG GTGGCACTTT TCGGGGAAAT 
TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG 
GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAG T ATT C AA 
TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC 
AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC 
CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AG AAC GTTTT 
AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC 
CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA 
TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC 
TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGAC CGAAG 
CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA 
ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG 
ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA 
GGATAAAGTT G C AGGAC C AC TTCTGCGCTC GGCCCTTCCG 
TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT 
TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT 
AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG 
AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT 
GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT 
CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT 
CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA 
TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC 
TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC 
TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT 
TCTTACCGGG TTGGACTCAA GACGATAGTT AC CGGATAAG 
GGGGGGTTCG TG C AC AC AG C CCAGCTTGGA GCGAACGACC 
ACAGCGTGAG CATTGAGAAA GCGCCACGCT TCCCGAAGGG 
GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG 
GTATC TTTAT AGTCCTGTCG GGTTTCGCCA C CTCTG ACTT 
CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC 
GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG 
TAAC C GTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC 
CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG 
TCTGTGCGGT ATTTCACACC G C AG AC C AG C CGCGTAACCT 
AGTAATAAAT GGATGCCCTG CGTAAGCGGG TGTGGGCGGA- 




2 521 CAATAAAGTC TTAAACTGAA CAAAATAGAT CTAAACTATG ACAATAAAGT CTTAAACTAG 
2 581 ACAGAATAGT TGTAAACTGA AATCAGTCCA GTTATGCTGT G AAAAAG C AT ACTGGACTTT 
2 641 TGTTATGGCT AAAGCAAACT CTTCATTTTC TGAAGTGCAA ATTGCCCGTC GTATTAAAGA 
27 01 GGGGCGTGGC CAAGGGCATG GTAAAGACTA TATTCGCGGC GTTGTGACAA TTTACCGAAC 
2 761 AACTCCGCGG CCGGGAAGCC GATCTCGGCT TGAACGAATT GTTAGGTGGC GGTACTTGGG 
2 821 TCGATATCAA AGTGCATCAC TTCTTCCCGT ATGCCCAACT TTGTATAGAG AGCCACTGCG 
2881 GGATCGTCAC CGTAATCTGC TTGCACGTAG ATCACATAAG CACCAAGCGC GTTGGCCTCA 

2 941 TGCTTGAGGA GATTGATGAG CGCGGTGGCA ATGCCCTGCC TCCGGTGCTC GCCGGAGACT 

3 001 G C GAG AT CAT AGATATAGAT CTCACTACGC GGCTGCTCAA ACCTGGGCAG AACGTAAGCC 
3 061 GCGAGAGCGC CAACAACCGC TTCTTGGTCG AAGGCAGCAA GCGCGATGAA TGTCTTACTA 
3121 CGGAGCAAGT TCCCGAGGTA ATCGGAGTCC GGCTGATGTT GGGAGTAGGT GGCTACGTCT 
3181 CCGAACTCAC GACCGAAAAG ATCAAGAGCA GCCCGCATGG ATTTGACTTG GTCAGGGCCG 
3 241 AGCCTACATG TGCGAATGAT GCCCATACTT GAGCCACCTA ACTTTGTTTT AGGGCGACTG 
3 3 01 CCCTGCTGCG TAACATCGTT GCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCAAACA 
3 3 61 TCGACCCACG GCGTAACGCG CTTGCTGCTT GGATGCCCGA GGCATAGACT GTACAAAAAA 
3421 ACAGTCATAA CAAGCCATGA AAACCGCCAC TGCGCCGTTA CCACCGCTGC GTTCGGTCAA 
34 81 GGTTCTGGAC CAGTTGCGTG AGCGCATACG CTAC TTGC AT TACAGTTTAC GAAC CGAACA 
3 541 GGCTTATGTC AACTGGGTTC GTGC CTTCAT CCGTTTCCAC GGTGTGCGTC ACCCGGCAAC 
3 6 01 CTTGGGCAGC AGCGAAGTCG AGGCATTTCT GTCCTGGCTG GCGAACGAGC GCAAGGTTTC 
3 661 GGTCTCCACG CATCGTCAGG CATTGGCGGC CTTGCTGTTC TTCTACGGCA AGGTGCTGTG 
3 721 CACGGATCTG CCCTGGCTTC AGGAGATCGG AAGAC CTCGG CCGTCGCGGC GCTTGCCGGT 
3 781 GGTGCTGACC CCGGATGAAG TGGTTCGCAT CCTCGGTTTT CTGGAAGGCG AG CAT C GTTT 
3 841 GTTCGCCCAG GACTCTAGCT ATAGTTCTAG TGGTTGGCTA CGTATCGAGC AAGAAAATAA 
3 9 01 AACGCCAAAC GCGTTGGAGT CTTGTGTGCT ATTTTTACAA AG ATT C AG AA ATACG CATC A 

3 961 CTTACAACAA GGGGGACTAT GAAATTATGC ATTTTGAGGA TGCCGGGACC TTTAATTCAA 

4 021 CCCAACACAA TATATTATAG TTAAATAAGA ATTATTTATC AAATCATTTG TATATTAATT 
4 081 AAAATACTAT AC TGTAAATT ACATTTTATT TACAATGAGG ATCATCACAA GTTTGTACAA 
4141 AAAAGCTGAA CGAGAAACGT AAAATGATAT AAATATCAAT ATATTAAATT AG ATTTTG C A 
42 01 TAAAAAACAG AC TAC ATAAT ACTGTAAAAC ACAACATATC CAGTCACTAT GGCGGCCGCT 
42 61 AAGTTGGCAG CATCACCCGA CGCACTTTGC G C C G AAT AAA TACCTGTGAC GGAAGATCAC 
4321 TTCGCAGAAT AAATAAATCC TGGTGTCCCT GTTGATACCG GGAAGCCCTG GGCCAACTTT 
4381 TGGCGAAAAT GAGACGTTGA TCGGCACGTA AGAGGTTCCA ACTTTCACCA TAATGAAATA 
4441 AGATCACTAC CGGGCGTATT TTTTGAGTTA TCGAGATTTT CAGGAGCTAA GGAAGCTAAA 
4 501 ATGGAGAAAA AAATCACTGG AT AT AC C AC C G TTG AT AT AT CCCAATGGCA TCGTAAAGAA 
4 561 CATTTTGAGG CATTTCAGTC AGTTGCTCAA TGTAC CTATA AC C AG AC C GT TCAGCTGGAT 
4 621 ATTACGGCCT TTTTAAAGAC C GT AAAG AAA AATAAGCACA AGTTTTATCC GGCCTTTATT 
4681 CACATTCTTG CCCGCCTGAT GAATGCTCAT CCGGAATTCC GTATGGCAAT GAAAGACGGT 
4741 GAGCTGGTGA TATGGGATAG TGTTCACCCT TGTTACACCG TTTTCCATGA GCAAACTGAA 
4 8 01 ACGTTTTCAT CGCTCTGGAG TG AAT AC C AC GACGATTTCC GGCAGTTTCT ACACATATAT 
4861 TCGCAAGATG TGGCGTGTTA CGGTGAAAAC CTGGC CTATT TCCCTAAAGG GTTTATTGAG 
4 921 AATATGTTTT TCGTCTCAGC CAATCCCTGG GTGAGTTTCA CCAGTTTTGA TTTAAACGTG 

4 981 GCCAATATGG ACAACTTCTT CGCCCCCGTT TTCACCATGG G CAAATATTA TACGCAAGGC 
5041 GACAAGGTGC TGATGCCGCT GGCGATTCAG GTTCATCATG CCGTCTGTGA TGGCTTCCAT 
5101 GTCGGCAGAA TG CTTAATGA ATTACAACAG TACTGCGATG AGTGGCAGGG CGGGGCGTAA 
5161 ACGCGTGGAT CCGGCTTACT AAAAGCCAGA TAACAGTATG CGTATTTGCG CGCTGATTTT 

's'221 TGCGGTATAA GAATATATAC TGATATGTAT ACCCGAAGTA TGT C AAAAAG AGGTGTGCTA 
52 81 TGAAGCAGCG TATTACAGTG ACAGTTGACA GCGACAGCTA TCAGTTGCTC AAGGCATATA 

5 341 TGATGTCAAT ATCTCCGGTC TGGTAAG C AC AACCATGCAG AATGAAGCCC GTCGTCTGCG 
54 01*.TGCCGAACGC TGGAAAGCGG AAAATCAGGA AGGGATGGCT GAGGTCGCCC GGTTTATTGA 
54 61 AATGAACGGC TCTTTTGCTG AC G AG AAC AG GGACTGGTGA AATGCAGTTT AAGGTTTACA 
5 521 CCTATAAAAG AGAGAGCCGT TATCGTCTGT TTGTGGATGT ACAGAGTGAT ATTATTGACA 
5581 CGCCCGGGCG ACGGATGGTG ATCCCCCTGG CCAGTGCACG TCTGCTGTCA GATAAAGTCT 
5641 CCCGTGAACT TTACCCGGTG GTGCATATCG GGGATGAAAG CTGGCGCATG ATGACCACCG 

57 01 ATATGG C C AG TGTGCCGGTC TCCGTTATCG GGGAAGAAGT GGCTGATCTC AGCCACCGCG 
5761 AAAATGACAT CAAAAACGCC ATTAAC CTGA TGTTCTGGGG AATATAAATG TCAGGCTCCC 
5821 TTATACACAG CCAGTCTGCA GGTCGACCAT AGTGACTGGA TATGTTGTGT TTTAC AG TAT 

58 81 TATGTAGTCT GTTTTTTATG CAAAATCTAA TTTAATATAT TGATATTTAT ATCATTTTAC 
5 941 GTTTCTCGTT CAGCTTTCTT GTACAAAGTG GTGATAG CTT GTCGAGAAGT ACTAGAGGAT- 




6 001 C ATAATC AG C CAT AC C AC AT TTGTAGAGGT TTTACTTGCT TTAAAAAACC TCCCACACCT 
6 061 CCCCCTGAAC CTGAAACATA AAATGAATGC AATTGTTGTT GTTAACTTGT TTATTGCAGC 
6121 TTATAATGGT TACAAATAAA GC AATAG CAT CACAAATTTC ACAAATAAAG CATTTTTTTC 
6181 ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG TCTGGATCTG 
6241 ATCACTGCTT GAG C CTAGGA GATCCGAACC AGATAAGTGA AATCTAGTTC CAAACTATTT 
6301 TGTCATTTTT AATTTTCGTA TTAGCTTACG ACGCTACACC C AGTTC C CAT CTATTTTGTC 
6361 ACTCTTCCCT AAATAATCCT TAAAAACTCC ATTTCCACCC CTCCCAGTTC CCAACTATTT 
6421 TGTCCGCCCA CAGCGGGGCA TTTTTCTTCC TGTTATGTTT TTAATCAAAC ATCCTGCCAA 
64 81 CTCCATGTGA CAAACCGTCA TCTTCGGCTA CTTTTTCTCT GTCACAGAAT GAAAATTTTT 
6 541 CTGTCATCTC TTCGTTATTA ATGTTTGTAA TTGACTGAAT ATCAACGCTT ATTTG C AG C C 
6 6 01 TGAATGGCGA ATG 
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ggtgacgccg tcatctttcc attgtaacgt aaatggcaac ttgtagatga acgcgctgtc 
ccactgcggc agtagaaagg taacattgca tttaccgttg aacatctact tgcgcgacag 



61 



aaaaaaccgg ccagtttctt ccacaaactc 
ttttttggcc ggtcaaagaa ggtgtttgag 



121 ^gcaacaatcg cgatgacctc gtggtatgga 
j f cattqttaqc gctactggag caccatacct 



gcgcacggct 
cgcgtgccga 



aattttttct 
_ttaaaaaaga 



gtctcgtaaa cttttgcgtc 
cagagcattt gaaaacgcag 

?J±£ll -// 

aaaaaagtgt cgttcatgtc 
tttt ttca ca gcaagtacag^. 



181 



^ ggcggcggcg ttcgcgctcc g gtacgcgcg 
^ ccgccgccgc aagcgcgagg q catgcgcgc 



241 



ctcgattatc ataaacaatc ctgcaggcat 
gagctaatag tatttgttag gacgtccgta 



acgggcacac 
tgcccgtgtg 

gcaagctgga 
cgttcgacct 



agcaggacag ccttgtccgg 
tcgtc ct^tCj ggaacaggcc 

tcatcacaag ttt jgtacaaaT 
ag tag tgttc aaacat 
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pDEST19 6668 bp (rotated to position 1000) 



Location (Base Nos.) 
515 . .391 



Gene Encoded 



765 . . 1424 
1544 . . 1628 
1766 . .2071 
2112 . .2236 
2852 . .2895 
3344 . .4319 
4460 . .5114 
5608 . .52 
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attR2 
lacZ 
ampR 



on 



genR 



1 AGTGGTTCGC ATCCTCGGTT TTCTGGAAGG CGAGCATCGT TTGTTCGCCC AGGACTCTAG 
61 CTATAGTTCT AGTGGTTGGC TAC GTATATC AAATACTTGT AGGTGACGCC GTCATCTTTC 
121 CATTGTAACG TAAATGGCAA CTTGTAGATG AACGCGCTGT CAAAAAACCG GCCAGTTTCT 
181 TCCACAAACT CGCGCACGGC TGTCTCGTAA ACTTTTGCGT CGCAACAATC GCGATGACCT 
241 CGTGGTATGG AAATTTTTTC TAAAAAAGTG TCGTTCATGT CGGCGGCGGG CGCGTTCGCG 
3 01 CTCCGGTACG CGCGACGGGC ACACAGCAGG AC AGC CTTGT CCGGCTCGAT TATCATAAAC 
361 AATCCTGCAG GCATGCAAGC TCGGATCATC ACAAGTTTGT ACAAAAAAGC TGAACGAGAA 
421 AC GT AAAATG ATATAAATAT CAATATATTA AATTAGATTT TGCATAAAAA ACAGACTACA 
481 TAATACTGTA AAACACAACA TATCCAGTCA CTATGGCGGC CGCTAAGTTG GC AG CATC AC 
541 CCGACGCACT TTGCGCCGAA TAAATACCTG TGACGGAAGA TCACTTCGCA GAATAAATAA 
6 01 ATCCTGGTGT CCCTGTTGAT ACCGGGAAGC CCTGGGCCAA CTTTTGGCGA AAATGAGACG 
661 TTGATCGGCA CGTAAGAGGT TCCAACTTTC AC C ATAATGA AATAAGATCA CTACCGGGCG 
721 TATT TTTTG A GTTATCGAGA TTTTCAGGAG CTAAGGAAGC TAAAATGGAG AAAAAAATCA 
781 CTGGATATAC C AC CGTTG AT ATATCCCAAT GGCATCGTAA AGAACATTTT GAGGCATTTC 
841 AGTCAGTTGC TCAATGTACC TATAACCAGA CCGTTCAGCT GGATATTACG GCCTTTTTAA 
901 AGAC CGTAAA GAAAAATAAG CACAAGTTTT ATCCGGCCTT TATTCACATT CTTGCCCGCC 
961 TGATGAATGC TCATCCGGAA TTCCGTATGG CAATGAAAGA CGGTGAGCTG GTGATATGGG 
1021 ATAGTGTTCA CCCTTGTTAC ACCGTTTTCC ATGAG C AAAC TGAAACGTTT TCATCGCTCT 
1081 GGAGTGAATA C C AC G AC GAT TTCCGGCAGT TTCTACACAT ATATTCGCAA GATGTGGCGT 
1141 GTTACGGTGA AAACCTGGCC TATTTCCCTA AAGGGTTTAT TGAGAATATG TTTTTCGTCT 
12 01 CAGCCAATCC CTGGGTGAGT TTCACCAGTT TTGATTTAAA CGTGGCCAAT ATGGACAACT 

12 61 TCTTCGCCCC CGTTTTCACC ATGGGCAAAT ATTATACGCA AGGCGACAAG GTGCTGATGC 

13 21 CGCTGGCGAT TCAGGTTCAT CATGCCGTCT GTGATGGCTT CCATGTCGGC AGAATGCTTA 
13 81 ATGAATTACA ACAGTACTGC GATGAGTGGC AGGGCGGGGC GTAAACGCGT GGATCCGGCT 
1441 TACTAAAAGC CAGATAACAG TATG C GTATT TGCGCGCTGA TTTTTGCGGT ATAAGAATAT 

15 01 ATACTGATAT GTATACCCGA AGTATGTCAA AAAGAGGTGT GCTATGAAGC AGCGTATTAC 
1561 AGTGACAGTT GACAGCGACA GCTATCAGTT GCTCAAGGCA TATATGATGT CAATATCTCC 

16 21 GGTCTGGTAA G C AC AAC CAT GCAGAATGAA GCCCGTCGTC TGCGTGCCGA ACGCTGGAAA 
16 81 GCGGAAAATC AGGAAGGGAT GGCTGAGGTC GCCCGGTTTA TTGAAATGAA CGGCTCTTTT 
1741 GCTGACGAGA ACAGGGACTG GTGAAATGCA GTTTAAGGTT TACACCTATA AAAGAGAGAG 
18 01 CCGTTATCGT CTGTTTGTGG ATGTACAGAG TGATATTATT GACACGCCCG GGCGACGGAT 

,18 61 GGTGATCCCC CTGGCCAGTG CACGTCTGCT GT C AG AT AAA GTCTCCCGTG AACTTTACCC 
1921 GGTGGTGCAT ATCGGGGATG AAAGCTGGCG CATGATGACC AC CGATATG G CCAGTGTGCC 
1981 GGTCTCCGTT ATCGGGGAAG AAGTGGCTGA TCTCAGCCAC CGCG AAAATG ACATCAAAAA 
2 041 CGCCATTAAC CTGATGTTCT GGGGAATATA AATGTCAGGC TCCCTTATAC ACAGCCAGTC 
2101- TGCAGGTCGA CCATAGTGAC TGGATATGTT GTGTTTTACA GTATTATGTA GTCTGTTTTT 
2161 TATGCAAAAT CTAATTTAAT ATATTGATAT TTATATCATT TTACGTTTCT CGTTCAGCTT 
2 221 TCTTGTACAA AGTGGTGATC GAGAAGTACT AGAGGATCAT AATCAGCCAT AC C AC ATTTG 
22 81 TAGAGGTTTT ACTTGCTTTA AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA 
2341 TGAATGCAAT TGTTGTTGTT AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA 
24 01 ATAGCATCAC AAATTT C AC A AATAAAGCAT TTTTTT C AC T GCATTCTAGT TGTGGTTTGT 
24 61 C C AAAC T CAT CAATGTATCT TATCATGTCT GGATCTGATC ACTGCTTGAG CCTAGGAGAT 
2 521 C C G AAC C AG A TAAGTGAAAT CTAGTTCCAA ACTATTTTGT CATTTTTAAT TTTCGTATTA 
2 581 GCTTACGACG CTACACCCAG TTCCCATCTA TTTTGTCACT CTTCCCTAAA TAATC CTTAA - 



2 641 AAACTCCATT TCCACCCCTC CCAGTTCCCA 
2701 TTCTTCCTGT TATGTTTTTA ATCAAACATC 
2761 TCGGCTACTT TTTCTCTGTC ACAGAATGAA 
2 821 TTTGTAATTG ACTGAATATC AACGCTTATT 
28 81 GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG 

2 941 CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT 

3 001 GCTTTCCCCG TCAAGCTCTA AATCGGGGGC 
3 061 GGCACCTCGA CCCCAAAAAA CTTGATTAGG 
3121 GAT AG AC G GT TTTTCGCCCT TTGACGTTGG 
3181 TCCAAACTGG AACAACACTC AACCCTATCT 
3241 TGCCGATTTC GGCCTATTGG TTAAAAAATG 
3 3 01 TTAACAAAAT ATTAACGTTT' ACAATTTCAG 
3361 CCCCTATTTG TTTATTTTTC TAAATACATT 
3421 CCTGATAAAT GCTTCAATAA TATTGAAAAA 
3481 TCGCCCTTAT TCCCTTTTTT GCGGCATTTT 
3 541 TGGTGAAAGT AAAAGATGCT GAAGATCAGT 
3 6 01 ATCTCAACAG CGGTAAGATC CTTGAGAGTT 
3661 GCACTTTTAA AGTTCTGCTA TGTGGCGCGG 
3 721 AACTCGGTCG CCGCATACAC TATTCTCAGA 
3781 AAAAG CAT CT TACGGATGGC ATGACAGTAA 
3 841 GTGATAACAC TGCGGCCAAC TTACTTCTGA 
3 9 01 CTTTTTTGCA CAACATGGGG GATCATGTAA 

3 961 ATG AAG C CAT AC CAAACGAC GAGCGTGACA 

4 021 TGCGCAAACT ATTAACTGGC GAACTACTTA 
4 081 GGATGGAGGC GGATAAAGTT G C AG G AC C AC 
4141 TTATTGCTGA TAAATCTGGA GCCGGTGAGC 
4 2 01 GGCCAGATGG TAAGCCCTCC CGTATCGTAG 
42 61 TGGATGAACG AAATAGACAG ATCG CTGAGA 
4 321 TGTCAGACCA AGTTTACTCA TATATACTTT 
4 381 AAAGGATCTA GGTGAAGATC CTTTTTGATA 
4 441 TTTCGTTCCA CTGAGCGTCA GACCCCGTAG 
4 5 01 TTTTTCTGCG CGTAATCTGC TGCTTGCAAA 
4 561 GTTTGCCGGA TCAAGAGCTA CCAACTCTTT 
4 621 AG AT AC C AAA TACTGTCCTT CTAGTGTAGC 
4 681 TAGCACCGCC TACATACCTC GCTCTGCTAA 
4 741 ATAAGTCGTG TCTTACCGGG TTGGACTCAA 
4 8 01 CGGGCTGAAC GGGGGGTTCG TGCACACAGC 
4 861 TGAGATACCT ACAGCGTGAG CATTGAGAAA 
4 921 ACAGGTATCC GGTAAGCGGC AGGGTCGGAA 

4 981 GAAACGCCTG GTATCTTTAT AGTC CTGTCG 

5 041 TTTTGTGATG CTCGTCAGGG GGGCGGAGCC 
5101 TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 
5161 ATTCTGTGGA TAACCGTATT ACCGCCTTTG 
52 21 CGACCGAGCG CAGCGAGTCA GTGAGCGAGG 
52 81 TCCTTACGCA TCTGTGCGGT ATTTCACACC 

's'341 GTTACGGTTG AGTAATAAAT GGATGCCCTG 
54 01 TTAAACTGAA CAAAATAGAT CTAAACTATG 
54 61 TGTAAACTGA AATCAGTCCA GTTATGCTGT 
5 521vAAAGCAAACT CTTCATTTTC TGAAGTGCAA 
5 581 CAAGGGCATG GTAAAGACTA TATTCGCGGC 
5641 CCGGGAAGCC GATCTCGGCT TGAACGAATT 
57 01 AGTGCATCAC TTCTTCCCGT ATGCCCAACT 

57 61 CGTAATCTGC TTGCACGTAG ATCACATAAG 
5821 GATTGATGAG CGCGGTGGCA ATGCCCTGCC 

58 81 AGATATAGAT CTCACTACGC GGCTGCTCAA 

5 941 CAACAACCGC TTCTTGGTCG AAGGCAGCAA 

6 001 TCCCGAGGTA ATCGGAGTCC GGCTGATGTT 
6 061 GAC CGAAAAG ATCAAGAGCA GCCCGCATGG 



ACTATTTTGT CCGCCCACAG CGGGGCATTT 
CTGCCAACTC CATGTGACAA ACCGTCATCT 
AATTTTTCTG TCATCTCTTC GTTATTAATG 
TGCAGCCTGA ATGGCGAATG GACGCGCCCT 
TGGTTACGCG CAGCGTGACC GCTACACTTG 
TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG 
TCCCTTTAGG GTTCCGATTT AGTGCTTTAC 
GTGATGGTTC ACGTAGTGGG CCATCGCCCT 
AGTCCACGTT CTTTAATAGT GGACTCTTGT 
CGGTCTATTC TTTTGATTTA TAAGGGATTT 
AGCTGATTTA ACAAAAATTT AACGCGAATT 
GTGGCACTTT TCGGGGAAAT GTGCGCGGAA 
CAAATATGTA TCCGCTCATG AGACAATAAC 
GGAAGAGTAT GAGTATTCAA CATTTCCGTG 
GCCTTCCTGT TTTTGCTCAC CCAGAAACGC 
TGGGTGCACG AGTGGGTTAC ATCGAACTGG 
TTCGCCCCGA AGAACGTTTT CCAATGATGA 
TATTATCCCG TATTGACGCC GGGCAAGAGC 
ATGACTTGGT TGAGTACTCA CCAGTCACAG 
GAGAATTATG CAGTGCTGCC ATAAC CATGA 
CAACGATCGG AGG AC CGAAG GAGCTAACCG 
CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA 
CCACGATGCC TGTAGCAATG GCAACAACGT 
CTCTAGCTTC CCGGCAACAA TTAATAGACT 
TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT 
GTGGGTCTCG CGGTATCATT GCAGCACTGG 
TTATC TAC AC GACGGGGAGT CAGGCAACTA 
TAGGTGCCTC ACTGATTAAG CATTGGTAAC 
AGATTGATTT AAAACTTCAT TTTTAATTTA 
ATCTCATGAC CAAAATCCCT TAACGTGAGT 
AAAAG AT C AA AGGATCTTCT TGAGATCCTT 
CAAAAAAACC ACCGCTACCA GCGGTGGTTT 
TTCCGAAGGT AACTGGCTTC AGCAGAGCGC 
CGTAGTTAGG CCACCACTTC AAGAACTCTG 
TCCTGTTACC AGTGGCTGCT GCCAGTGGCG 
GAC G ATAGTT AC CGG ATAAG GCGCAGCGGT 
CCAGCTTGGA GCGAACGACC TACACCGAAC 
GCGCCACGCT TCCCGAAGGG AGAAAGGCGG 
CAGGAGAGCG CACGAGGGAG CTTCCAGGGG 
GGTTTCGCCA CCTCTGACTT GAG C GT C GAT 
TATGGAAAAA CGCCAGCAAC GCGGCCTTTT 
CTCACATGTT CTTTCCTGCG TTATC CCCTG 
AGTGAGCTGA TACCGCTCGC CGCAGCCGAA 
AAGCGGAAGA GCGCCTGATG CGGTATTTTC 
GCAGACCAGC CGCGTAACCT GGCAAAATCG 
CGTAAGCGGG TGTGGGCGGA CAATAAAGTC 
ACAATAAAGT CTTAAACTAG ACAGAATAGT 
G AAAAAG CAT ACTGGACTTT TGTTATGGCT 
ATTGCCCGTC GTATTAAAGA GGGGCGTGGC 
GTTGTGACAA TTTACCGAAC AACTCCGCGG 
GTTAGGTGGC GGTACTTGGG TCGATATCAA 
TTGTATAGAG AGCCACTGCG GGATCGTCAC 
CACCAAGCGC GTTGGCCTCA TGCTTGAGGA 
TCCGGTGCTC GCCGGAGACT G C GAG AT CAT 
ACCTGGGCAG AACGTAAGCC GCGAGAGCGC 
GCGCGATGAA TGTCTTACTA CGGAGCAAGT 
GGGAGTAGGT GGCTACGTCT CCGAACTCAC 
ATTTGACTTG GTCAGGGCCG AGCCTACATG - 



6121 TGCGAATGAT GCCCATACTT GAGCCACCTA ACTTTGTTTT AGGGCGACTG CCCTGCTGCG 

6181 TAACATCGTT GCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCAAACA TCGACCCACG 

6241 GCGTAACGCG CTTGCTGCTT GGATGCCCGA GGCATAGACT GTACAAAAAA ACAGTCATAA 

6 3 01 CAAGCCATGA AAACCGCCAC TGCGCCGTTA CCACCGCTGC GTTCGGTCAA GGTTCTGGAC 

6361 CAGTTGCGTG AGCGCATACG CTACTTGCAT TACAGTTTAC GAACCGAACA GGCTTATGTC 

6421 AACTGGGTTC GTGCCTTCAT CCGTTTCCAC GGTGTGCGTC ACCCGGCAAC CTTGGGCAGC 

64 81 AGCGAAGTCG AGGCATTTCT GTCCTGGCTG GCGAACGAGC GCAAGGTTTC GGTCTCCACG 

6541 CATCGTCAGG CATTGGCGGC CTTGCTGTTC TTCTACGGCA AGGTGCTGTG CACGGATCTG 

66 01 CCCTGGCTTC AGGAGATCGG AAGACCTCGG CCGTCGCGGC GCTTGCCGGT GGTGCTGACC 
6661 CCGGATGA 




2 581 GAGATTTAAA GCTCTAAGGT AAATATAAAA TTTTTAAGTG TATAATGTGT TAAACTACTG 
2 641 ATTCTAATTG TTTGTGTATT TTAGATTC C A ACCTATGGAA CTGATGAATG GGAGCAGTGG 
2701 TGGAATGCCT TTAATGAGGA AAACCTGTTT TGCTCAGAAG AAATGCCATC TAGTGATGAT 
2761 GAGGCTACTG CTGACTCTCA ACATTCTACT CCTCCAAAAA AGAAGAGAAA GGTAGAAGAC 
2821 CCCAAGGACT TTCCTTCAGA ATTGCTAAGT TTTTTGAGTC ATGCTGTGTT TAGTAATAGA 
2881 ACTCTTGCTT GCTTTGCTAT TTAC AC C AC A AAGGAAAAAG CTGCACTGCT ATACAAGAAA 

2 941 ATTATGGAAA AATATTCTGT AACCTTTATA AGTAGGCATA ACAGTTATAA TCATAACATA 

3 0 01 CTGTTTTTTC TTACTCCACA CAGGCATAGA GTGTCTGCTA TTAATAACTA TGCTCAAAAA 
3 061 TTGTGTACCT TTAGCTTTTT AATTTGTAAA GGGGTTAATA AGGAATATTT GATGTATAGT 
3121 GCCTTGACTA GAGATCATAA TCAGCCATAC CACATTTGTA GAGGTTTTAC TTGCTTTAAA 
3181 AAACCTCCCA CACCTCCCCC TGAACCTGAA ACATAAAATG AATGCAATTG TTGTTGTTAA 
3241 CTTGTTTATT GCAGCTTATA ATGGTTACAA ATAAAGCAAT AGCATCACAA ATTTCACAAA 

33 01 TAAAGCATTT TTTTCACTGC ATTCTAGTTG TGGTTTGTCC AAACTCATCA ATGTATCTTA 
3361 TCATGTCTGG ATCCCCAGGA AGCTCCTCTG TGTCCTCATA AACCCTAACC TCCTCTACTT 
3421 GAGAGGACAT TCCAATCATA GGCTGCCCAT CCACCCTCTG TGTCCTCCTG TTAATTAGGT 

34 81 CACTTAACAA AAAGGAAATT GGGTAGGGGT TTTTCACAGA CCGCTTTCTA AGGGTAATTT 
3 541 TAAAATATCT GGGAAGTCCC TTCCACTGCT GTGTTCCAGA AGTGTTGGTA AACAGCCCAC 
3 6 01 AAATGTCAAC AGCAGAAACA TACAAGCTGT CAGCTTTGCA CAAGGGCCCA ACACCCTGCT 
3661 CATCAAGAAG CACTGTGGTT GCTGTGTTAG TAATGTGCAA AACAGGAGGC AC ATTTT C C C 
3721 CACCTGTGTA GGTTCCAAAA TATCTAGTGT TTTCATTTTT ACTTGGATCA GGAAC C C AG C 
3 781 ACTCCACTGG ATAAGCATTA TCCTTATCCA AAACAGCCTT GTGGTCAGTG TTCATCTGCT 
3841 GACTGTCAAC TGTAGCATTT TTTGGGGTTA C AGTTTG AG C AGGATATTTG GTCCTGTAGT 
3 9 01 TTGCTAACAC ACCCTGCAGC TCCAAAGGTT CCCCACCAAC AGCAAAAAAA TGAAAATTTG 

3 961 ACCCTTGAAT GGGTTTTCCA GCACCATTTT CATGAGTTTT TTGTGTCCCT GAATGCAAGT 

4 021 TTAACATAGC AGTTACCCCA ATAAC CTC AG TTTTAACAGT AACAGCTTCC CACATCAAAA 
4 081 TATTTC C AC A GGTTAAGTCC TCATTTAAAT TAGGCAAAGG AATTGCTCTA GAGCGGCCGC 
4141 CACCGCGGTG GAGCTCCAAT TCGCCCTATA GTGAGTCGTA TTACGCGCGC TCACTGGCCG 
42 01 TCGTTTTACA ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTAAT CGCCTTGCAG 
4261 CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT CGCCCTTCCC 
4 321 AACAGTTGCG CAGCCTGAAT GGCGAATGGG ACGCGCCCTG TAGCGGCGCA TTAAGCGCGG 
4381 CGGGTGTGGT GGTTACGCGC AGCGTGACCG CTACACTTGC CAGCGCCCTA GCGCCCGCTC 
44 41 CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA 
4 5 01 ATCGGGGGCT CCCTTTAGGG TTC CGATTT A GTGCTTTACG GCACCTCGAC CCCAAAAAAC 
4 561 TTGATTAGGG TGATGGTTCA CGTAGTGGGC CATCGCCCTG ATAGACGGTT TTTCGCCCTT 
4 621 TGACGTTGGA GTCCACGTTC TTTAATAGTG GACTCTTGTT CCAAACTGGA ACAACACTCA 
4 681 ACCCTATCTC GGTCTATTCT TTTGATTTAT AAGGGATTTT GCCGATTTCG GCCTATTGGT 
4 741 TAAAAAATGA GCTGATTTAA CAAAAATTTA ACGCGAATTT TAACAAAATA TTAACGCTTA 
4 8 01 CAATTTAGGT GGCACTTTTC GGGGAAATGT GCGCGGAACC CCTATTTGTT TATTTTTCTA 
4861 AATAC ATT C A AATATGTATC CGCTCATGAG ACAATAACCC TGATAAATGC TTCAATAATA 
4 921 TTGAAAAAGG AAGAGTATGA GTATTCAACA TTTCCGTGTC GCCCTTATTC CCTTTTTTGC 

4 981 GGCATTTTGC CTTCCTGTTT TTGCTCACCC AGAAACGCTG GTGAAAGTAA AAGATGCTGA 

5 041 AGATCAGTTG GGTGCACGAG TGGGTTACAT CGAACTGGAT CTC AAC AG C G GTAAGATCCT 
5101 TGAGAGTTTT CGCCCCGAAG AACGTTTT C C AATGATGAGC ACTTTTAAAG TTCTG CTATG 
5161 TGGCGCGGTA TTATCCCGTA TTGACGCCGG GCAAGAGCAA CTCGGTCGCC GCATACACTA 

^221 TTCTCAGAAT GACTTGGTTG AGTACTCACC AGTCACAGAA AAGCATCTTA CGGATGGCAT 
5'2 81 GACAGTAAGA GAATTATGCA GTGCTGCCAT AACCATGAGT GATAACACTG CGGCCAACTT 
5341 ACTTCTGACA ACGATCGGAG GACCGAAGGA GCTAACCGCT TTTTTGCACA ACATGGGGGA 

54 01 TCATGTAACT CGCCTTGATC GTTGGG AAC C GGAGCTGAAT GAAGCCATAC CAAACGACGA 
5 4 6 1 1 GCGTGACACC ACGATGCCTG TAGCAATGGC AACAACGTTG CGCAAACTAT TAACTGGCGA 
5521 ACTACTTACT CTAGCTTCCC GGCAACAATT AATAGACTGG ATGGAGGCGG ATAAAGTTGC 

55 81 AGGACCACTT CTGCGCTCGG CCCTTCCGGC TGGCTGGTTT ATTGCTGATA AATCTGGAGC 
5641 CGGTGAGCGT GGGTCTCGCG GTATCATTGC AGCACTGGGG CCAGATGGTA AGCCCTCCCG 
5701 TATCGTAGTT ATCTACACGA CGGGGAGTCA GGCAACTATG GATGAAC GAA ATAGACAGAT 
5761 CGCTGAGATA GGTGCCTCAC TGATTAAG C A TTGGTAACTG TCAGACCAAG TTTACT C ATA 
5821 TATACTTTAG ATTGATTTAA AACTTCATTT TTAATTTAAA AGGATCTAGG TGAAGATCCT 
58 81 TTTTGATAAT CTCATGACCA AAATCCCTTA ACGTGAGTTT TCGTTC CACT GAG CGT C AG A 

5 941 CCCCGTAGAA AAGATCAAAG GATCTTCTTG AGATCCTTTT TTTCTGCGCG TAATCTGCTG 

6 001 CTTGCAAACA AAAAAACCAC CGCTACCAGC GGTGGTTTGT TTGCCGGATC AAGAGCTACC- 



6 061 AACTCTTTTT CCGAAGGTAA CTGGCTTCAG C AG AG C G C AG ATACCAAATA CTGTCCTTCT 
6121 AGTGTAGCCG TAGTTAGGCC ACCACTTCAA GAACTCTGTA GCACCGCCTA CATACCTCGC 
6181 TCTGCTAATC CTGTTACCAG TGGCTGCTGC CAGTGGCGAT AAGTCGTGTC TTACCGGGTT 
6241 GGACTCAAGA CGATAGTTAC CGGATAAGGC GCAGCGGTCG GGCTGAACGG GGGGTTCGTG 

63 01 CACACAGCCC AGCTTGGAGC GAACGACCTA CACCGAACTG AG ATAC CTAC AGCGTGAGCT 
6361 ATGAGAAAGC GCCACGCTTC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCAG 
6421 GGTCGGAACA GGAGAGCGCA CGAGGGAGCT TCCAGGGGGA AACGCCTGGT ATCTTTATAG 

64 81 TCCTGTCGGG TTTCGCCACC TCTGACTTGA GCGTCGATTT TTGTGATGCT CGTCAGGGGG 
6 541 GCGGAGCCTA TGGAAAAACG CCAGCAACGC GGCCTTTTTA CGGTTCCTGG CCTTTTGCTG 
6 6 01 GCCTTTTGCT CACATGTTCT TTCCTGCGTT ATCCCCTGAT TCTGTGGATA ACCGTATTAC 
6661 CGCCTTTGAG TGAGCTGATA CCGCTCGCCG CAGCCGAACG ACCGAGCGCA GCGAGTCAGT 
6721 GAG CG AGGAA GCGGAAGAGC GCCCAATACG CAAACCGCCT CTCCCCGCGC GTTGGCCGAT 
6 781 TCATTAATGC AGCTGGCACG AC AGGTTTC C CGACTGGAAA GCGGGCAGTG AGCGCAACGC 
6 841 AATTAATGTG AGTTAGCTCA CTCATTAGGC ACCCCAGGCT TTACACTTTA TGCTTCCGGC 
6 9 01 TCGTATGTTG TGTGGAATTG TGAGCGGATA ACAATTTCAC ACAGGAAACA GCTATGACCA 
6 961 TGATTACGCC AAGCGCGCAA TTAACCCTCA CTAAAGGGAA CAAAAGCTGG GTACCGGGCC 
7021 CCCCCT 




Figure 3^4: pDEST12.2 CMV Promoter for Eukaryotic 
Expression, SV40 Promoter/ori for G418 Resistance 



307 
3S8 
409 
460 
511 



acc 
tgg 

gac 
ctg 

taa 

att 

tat 
ata 



jr+mKMAr -from 
grc aga teg cct 
cag tct age gga 

aec ggg acc gat 
tgg ccc tgg eta 

eaa ttt cac aca 
gtt aaa gtg tgt 

tta ggt gac act 
aat cca ctg tga 



gga gAc gec ate cac get gtt ttg acc tec ata gaa 
cct ctg egg tag gtg cga caa aac tgg agg tat ctt 

cca gec tec gga etc tag cct agg ccg egg age gga 
Wt egg agg cct gag ate gga tec ggc gee teg cct 

gga aac age tat gac cat tag gee ttt gca aaa age 
cct ttg teg ata ctg gta ate egg aaa cgt ttt teg 

ata gaa ggt acg cct gca ggt a ge qqt ccg gkattc 
tat ctt cca tgc gga cgt cca tgg cck ggc ctt aab 




Eam1105 I 6257 



Nut I 1 

SspLLM 1 I 7150 I SnaB I 292 

\ I Bbs 1 581 

-J 1 / - Sst II 623 

Sse8387!713 

Not I 855 



Xmn I 5658 



Nsp V 4967 
Rsr II 4801 




EcoR I 1176 
PfM I 1402 



Mlu\ 1623 
Ssfl 1 07 I 1709 



S/7I2044 
Sma I 2045 
x Ava I 2045 
Sal I 2304 



Nhe I 2533 



Tth\ 1 1 I 4401 

Nar I 4285 
Af\ II 4140 
Hind 111 4108 
BseR I 4073 

Sff I 4038 



Be/ 1 2809 

Mun I 2902 
Hpa I 2913 



Dra III 3404 



SexA I 3859 



pDEST12.2 7278 bp (rotated to position 3900) 
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Gene Encoded 
ori 

CMV promoter 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

small t & polyA 
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neo 

ampR 



1 GGGGGGCGGA GCCTATGGAA AAACGCCAGC AACGCGGCCT TTTTACGGTT CCTGGCCTTT 
61 TGCTGGCCTT TTGCTCACAT GTTCTTTCCT GCGTTATCCC CTGATTCTGT GGATAACCGT 
121 ATTACCGCCT TTGAGTGAGC TGATACCGCT CGCCGCAGCC GAACGACCGA GCGCAGCGAG 
181 TCAGTGAGCG AGGAAGCGGA AGAGCTCGCG AATGCATGTC GTTACATAAC TTACGGTAAA 
241 TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT 
3 01 TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGAGT ATTTACGGTA 
3 61 AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT 
421 CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC ATGAC CTT AT GGGACTTTCC 
481 TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC GGTTTTGGCA 
541 GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC TCCACCCCAT 
6 01 TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA AATGTCGTAA 
661 CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA CGGTGGGAGG TCTATATAAG 
721 CAGAGCTCGT TTAGTGAACC GTCAGATCGC CTGGAGACGC CATCCACGCT GTTTTGACCT 
781 CCATAGAAGA CACCGGGACC GATCCAGCCT CCGGACTCTA GCCTAGGCCG CGGGACGGAT 
841 AACAATTTCA CACAGGAAAC AGC TATGACC ATTAGGCCTT TGCAAAAAGC TATTTAGGTG 
901 ACACTATAGA AGGTACGCCT GCAGGTACCG GAT C AC AAGT TTGTACAAAA AAGCTGAACG 
961 AGAAACGTAA AATGATATAA ATATCAATAT ATTAAATTAG ATTTTGCATA AAAAACAGAC 
1021 TACATAATAC TGTAAAACAC AACATATCCA GTCACTATGG CGGCCGCATT AGGCACCCCA 
1081 GGCTTTACAC TTTATGCTTC CGGCTCGTAT AATGTGTGGA TTTTGAGTTA GGATCCGTCG 
1141 AGATTTTCAG GAGCTAAGGA AGCTAAAATG GAGAAAAAAA TCACTGGATA TACCACCGTT 
12 01 GATATATCCC AATGGCATCG TAAAGAACAT TTTGAGGCAT TTCAGTCAGT TGCTCAATGT 

12 61 ACCTATAACC AGACCGTTCA GCTGGATATT ACGGCCTTTT TAAAGAC CGT AAAGAAAAAT 

13 21 AAGCACAAGT TTTATCCGGC CTTTATTCAC ATTCTTGCCC GCCTGATGAA TGCTCATCCG 
1381 GAATTCCGTA TGGCAATGAA AGACGGTGAG CTGGTGATAT GGGATAGTGT TCACCCTTGT 
1441 TACACCGTTT TCCATGAGCA AACTGAAACG TTTTCATCGC TCTGGAGTGA AT AC CACGAC 
1501 GATTTCCGGC AGTTTCTACA CATATATTCG CAAGATGTGG CGTGTTACGG TGAAAACCTG 
1561 GCCTATTTCC CTAAAGGGTT TATTGAGAAT ATGTTTTTCG TCTCAGCCAA TCCCTGGGTG 
16 21 AGTTTCACCA GTTTTGATTT AAACGTGGCC AATATGGACA ACTTCTTCGC CCCCGTTTTC 
16 81 ACCATGGGCA AATATTATAC GCAAGGCGAC AAGGTGCTGA TGCCGCTGGC GATTCAGGTT 
1741 CATCATGCCG TCTGTGATGG CTTCCATGTC GGCAGAATGC TTAATGAATT ACAACAGTAC 
18 01 TGCGATGAGT GGCAGGGCGG GGCGTAAACG CGTGGATCCG GCTTACTAAA AGCCAGATAA 
18 61 CAGTATGCGT ATTTGCGCGC TGATTTTTGC GGTATAAGAA TAT AT AC TG A T ATGT AT AC C 
1921 .CGAAGTATGT CAAAAAGAGG TGTGCTATGA AGCAGCGTAT TACAGTGACA GTTGACAGCG 
1981 ACAGCTATCA GTTGCTCAAG GCATATATGA TGTCAATATC TCCGGTCTGG TAAGCACAAC 
2 041 CATGCAGAAT GAAGCCCGTC GTCTGCGTGC CGAACGCTGG AAAGCGGAAA ATCAGGAAGG 
2101 GATGGCTGAG GTCGCCCGGT TTATTGAAAT GAACGGCTCT TTTGCTGACG AGAACAGGGA 
2161 CTGGTGAAAT GCAGTTTAAG GTTTAC AC CT ATAAAAGAGA GAGCCGTTAT CGTCTGTTTG 
2 221 TGGATGTACA GAGTGATATT ATTGACACGC CCGGGCGACG GATGGTGATC CCCCTGGCCA 
2281 GTGCACGTCT GCTGTCAGAT AAAGTCTCCC GTGAACTTTA CCCGGTGGTG CAT AT C GGGG 
2 3 41 ATGAAAGCTG GCGCATGATG ACCACCGATA TGGCCAGTGT GCCGGTCTCC GTT AT CGGGG 
24 01 AAGAAGTGGC TGATCTCAGC CACCGCGAAA ATGACATCAA AAACGCCATT AACCTGATGT - 



24 61 TCTGGGGAAT ATAAATGTCA GGCTCCCTTA TACACAGCCA GTCTGCAGGT CGACCATAGT 

2 521 GACTGGATAT GTTGTGTTTT ACAGTATTAT GTAGTCTGTT TTTTATGCAA AATCTAATTT 

2581 AATATATTGA TATTTATATC ATTTTACGTT TCTCGTTCAG CTTTCTTGTA CAAAGTGGTG 

2641 ATCGCGTGCA TGCGACGTCA TAGCTCTCTC CCTATAGTGA GTCGTATTAT AAGCTAGGCA 

27 01 CTGGCCGTCG TTTTACAACG TCGTGACTGG GAAAACTGCT AGCTTGGGAT CTTTGTGAAG 

2761 GAACCTTACT TCTGTGGTGT GACATAATTG GACAAACTAC CTACAGAGAT TTAAAGCTCT 

2 821 AAGGTAAATA TAAAATTTTT AAGTGTATAA TGTGTTAAAC TAG CTG CAT A TGCTTGCTGC 

2 881 TTGAGAGTTT TGCTTACTGA GTATGATTTA TGAAAATATT AT AC AC AG G A GCTAGTGATT 

2 941 CTAATTGTTT GTGTATTTTA GATTCACAGT CCCAAGGCTC ATTTC AGG C C CCTCAGTCCT 

3 001 CACAGTCTGT TC ATGAT CAT AATCAGCCAT AC C AC ATTTG TAGAGGTTTT ACTTGCTTTA 
3 061 AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA TGAATGCAAT TGTTGTTGTT 
3121 AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAG C A ATAGCATCAC AAATTTCACA 
3181 AATAAAGCAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT C C AAACTC AT CAATGTATCT 
3241 TATCATGTCT GGATCGATCC TGCATTAATG AATCGGCCAA CGCGCGGGGA GAGGCGGTTT 

33 01 GCGTATTGGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT CGCCCTTCCC AACAGTTGCG 
3361 C AG C CTGAAT GGCGAATGGG ACGCGCCCTG TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT 
3421 GGTTACGCGC AGCGTGACCG CTACACTTGC CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT 

34 81 CTTCCCTTCC TTTCTCGCCA CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT 
3541 CCCTTTAGGG TTCCGATTTA GTGCTTTACG GCACCTCGAC CCCAAAAAAC TTGATTAGGG 

36 01 TGATGGTTCA CGTAGTGGGC CATCGCCCTG ATAGACGGTT TTTCGCCCTT TG AC GTTGGA 
3661 GTCCACGTTC TTTAATAGTG GACTCTTGTT CCAAACTGGA AC AAC AC T C A ACCCTATCTC 
3721 GGTCTATTCT TTTGATTTAT AAGGGATTTT GCCGATTTCG GCCTATTGGT TAAAAAATGA 

37 81 GCTGATTTAA CAAATATTTA ACGCGAATTT TAACAAAATA TTAACGTTTA CAATTTCGCC 
3 841 TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC ACACCGCATA CGCGGATCTG 
3 901 CGCAGCACCA TGGCCTGAAA TAACCTCTGA AAGAGGAACT TGGTTAGGTA CCTTCTGAGG 

3 961 CGGAAAGAAC CAGCTGTGGA ATGTGTGTCA GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC 

4 021 AGCAGGCAGA AGTATGCAAA GCATGCATCT CAATTAGTCA GCAACCAGGT GTGGAAAGTC 
4 081 CCCAGGCTCC C C AG C AG G C A GAAGTATG C A AAG C ATG CAT CTCAATTAGT CAGCAACCAT 
4141 AGTCCCGCCC CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC 
42 01 GCCCCATGGC TGACTAATTT TTTTTATTTA TGCAGAGGCC GAGGCCGCCT CGGCCTCTGA 
4261 GCTATTCCAG AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGA 
4 321 TTCTTCTGAC ACAACAGTCT CGAACTTAAG G C TAG AG C C A CCATGATTGA ACAAGATGGA 
4381 TTGCACGCAG GTTCTCCGGC CGCTTGGGTG GAGAGGCTAT TCGGCTATGA CTGGGCACAA 
4441 CAGACAATCG GCTGCTCTGA TGCCGCCGTG TTCCGGCTGT CAGCGCAGGG GCGCCCGGTT 

45 01 CTTTTTGTCA AGACCGACCT GTCCGGTGCC CTGAATGAAC TGCAGGACGA GGCAGCGCGG 
4 561 CTATCGTGGC TGGCCACGAC GGGCGTTCCT TGCGCAGCTG TGCTCGACGT TGTCACTGAA 
4 621 GCGGGAAGGG ACTGGCTGCT ATTGGGCGAA GTGCCGGGGC AGGATCTCCT GTCATCTCAC 

46 81 CTTGCTCCTG CCGAGAAAGT ATC CATC ATG GCTGATGCAA TGCGGCGGCT GCATACGCTT 
4 741 GATCCGGCTA CCTGCCCATT CGACCACCAA GCGAAACATC GCATCGAGCG AGCACGTACT 
4 8 01 CGGATGGAAG CCGGTCTTGT CGATCAGGAT GATCTGGACG AAG AG CAT C A GGGGCTCGCG 
48 61 CCAGCCGAAC TGTTCG C C AG GCTCAAGGCG CGCATGCCCG ACGGCGAGGA TCTCGTCGTG 
4 921 ACCCATGGCG ATGCCTGCTT GCCGAATATC ATGGTGGAAA ATGGCCGCTT TTCTGGATTC 

4 981 ATCGACTGTG GCCGGCTGGG TGTGGCGGAC CGCTATCAGG ACATAGCGTT GGCTACCCGT 

5 041 GATATTGCTG AAGAGCTTGG CGGCGAATGG GCTGACCGCT TCCTCGTGCT TTACGGTATC 
5101 GCCGCTCCCG ATTCGCAGCG CATCGCCTTC TATCGCCTTC TTGACGAGTT CTTCTGAGCG 
5161 GGACTCTGGG GTTCGAAATG ACCGACCAAG CGACGCCCAA CCTGCCATCA CGATGGCCGC 
52 21 AATAAAATAT CTTTATTTTC ATTACATCTG TGTGTTGGTT TTTTGTGTGA ATCGATAGCG 
52 81 ATAAGGATCC GCGTATGGTG CACTCTCAGT ACAATCTGCT CTGATGCCGC ATAGTTAAGC 
5341; .CAGCCCCGAC ACCCGCCAAC ACCCGCTGAC GCGCCCTGAC GGGCTTGTCT GCTCCCGGCA 
54 01 TCCGCTTACA GACAAGCTGT GACCGTCTCC GGGAGCTGCA TGTGTCAGAG GTTTTCACCG 

54 61 TCATCACCGA AACGCGCGAG ACGAAAGGGC CTCGTGATAC GCCTATTTTT ATAGGTTAAT 
5 521 GTCATGATAA TAATGGTTTC TTAGACGTCA GGTGGCACTT TTCGGGGAAA TGTGCGCGGA 

55 81 ACCCCTATTT GTTTATTTTT CTAAATACAT TCAAATATGT ATCCGCTCAT GAGACAATAA 
5641 C C C TG AT AAA TGCTTCAATA ATATTGAAAA AGGAAGAGTA TG AGTATT C A ACATTTCCGT 
57 01 GTCGCCCTTA TTCCCTTTTT TGCGGCATTT TGCCTTCCTG TTTTTGCTCA CCCAGAAACG 
57 61 CTGGTGAAAG TAAAAGATGC TG AAG ATC AG TTGGGTGCAC GAGTGGGTTA CATCGAACTG 
5821 GATCTCAACA GCGGTAAGAT CCTTGAGAGT TTTCGCCCCG AAGAACGTTT TCCAATGATG 
5881 AGCACTTTTA AAGTTCTGCT ATGTGGCGCG GTATTATC C C GTATTGACGC CGGGCAAGAG - 




R$o& L tObit p^ST20 Glutathione-S-transf erase Fusion with 
Polyhedron Promoter for Baculovirus Expression 
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atg tt t^ttt cga ct t 



aat ate aat ata tta aat tag at 
tta tag tta tat aat tta ate ta 



Ul 



u ■ 



S~,-!f 



5 2 



Swat 6815 
A/spV6787 
Bsg I 6656 

EcoO109 1 6422 
EcoN I 6397. 
SnaB I 6232 
Bbs\ 6103 



Not \ 131 

PvuW 350 



Eco Rl 452 

PflM I 678 

Xba I 899 
BssH II 946 



7Y/)1 1 1 I 5578 



BseR I 5299 
EcoR V 5176 
SstW 5118 




S//I 1320 
Sma\ 1321 
Psfl 1574 
BspM I 1576 
Sa/I 1580 

Muni 1830 
Hpal 1841 

>4vrll 1994 



NgoA IV 2480 
San II 2510 




V 



pDEST20 7066 bp (rotated to position 5800) 



Location (Base Nos.) 
592 . . 1263 
1397 . . 1273 
1506 . .2165 
2285 . .2369 
2507 . .2812 
2853 . .2977 
4214 . .5064 
5263 . . 5843 



Gene Encoded 

GST 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

ampR 

ori 





1 


CCACTGCGCC 


GTTACCACCG 




61 


TACGCTACTT 


GCATTACAGT 




121 


TCATCCGTTT 


CCACGGTGTG 




181 


TTCTGTCCTG 


GCTGGCGAAC 




241 


CGGCCTTGCT 


GTTCTTCTAC 




301 


TCGGAAGACC 


TCGGCCGTCG 




361 


GCATCCTCGG 


TTTTCTGGAA 




421 


CTAGTGGTTG 


GCTACGTATA 


a 


481 


AACCATCTCG 


CAAATAAATA 




541 


CCTATAAATA 


TTCCGGATTA 


111 


601 


ATACTAGGTT 


ATTGGAAAAT 


3 ! 

p« 


661 


CTTGAAGAAA 


AATATGAAGA 


%.J 


721 


AAAAAGTTTG 


AATTGGGTTT 




781 


AAATTAACAC 


AGTCTATGGC 


01 


841 


GGTTGTCCAA 


AAGAGCGTGC 


«^ 


901 


TACGGTGTTT 


CGAGAATTGC 


i 5 5 


961 


AGCAAGCTAC 


CTGAAATGCT 




1021 


AATGGTGATC 


ATGTAACCCA 




1081 


TACATGGACC 


CAATGTGCCT 




1141 


GAAGCTATCC 


CACAAATTGA 




1201 


CAGGGCTGGC 


AAGCCACGTT 


Vs 3 


1261 


CGTCATAATC 


AAACAAGTTT 


3-. .3 


1321 


ATCAATATAT 


TAAATTAGAT 




1381 


CAT AT C CAGT 


CACTATGGCG 




1441 


GCTCGTATGT 


TGTGTGGATT 




1501 


CTAAAATGGA 


GAAAAAAATC 




1561 


AAGAACATTT 


TGAGGCATTT 




1621 


TGGATATTAC 


GGCCTTTTTA 




1681 


TTATT C AC AT 


TCTTGCCCGC 




1741 


ACGGTGAGCT 


GGTGATATGG 




1801 


CTGAAACGTT 


TTCATCGCTC 




1861 


TATATTCGCA 


AGATGTGGCG 




/1 ; 921 


TTGAGAATAT 


GTTTTTCGTC 




1981 


ACGTGGCCAA 


TATGGACAAC 




2041 


AAGGCGACAA 


GGTGCTGATG 




2101 


TCCATGTCGG 


CAGAATGCTT 




2161* 


CGTAATCTAG 


AGGATCCGGC 




2221 


ATTTTTGCGG 


TATAAGAATA 




2281 


TGCTATGAAG 


CAGCGTATTA 




2341 


ATATATGATG 


TCAATATCTC 




2401 


CTGCGTGCCG 


AACGCTGGAA 




2461 


ATTGAAATGA 


ACGGCTCTTT 




2521 


TT AC AC C TAT 


AAAAGAGAGA 




2581 


TGACACGCCC 


GGGCGACGGA 




2641 


AGTCTCCCGT 


GAACTTTACC 



CTGCGTTCGG TCAAGGTTCT GGACCAGTTG CGTGAGCGCA 
TTACGAACCG AACAGGCTTA TGTCAACTGG GTTCGTGCCT 
CGTCACCCGG CAACCTTGGG CAGCAGCGAA GTCGAGGCAT 
GAGCGCAAGG TTTCGGTCTC CACGCATCGT CAGGCATTGG 
GGCAAGGTGC TGTGCACGGA TCTGCCCTGG CTT CAGGAGA 
CGGCGCTTGC CGGTGGTGCT GACCCCGGAT GAAGTGGTTC 
GGCGAGCATC GTTTGTTCGC CCAGGACTCT AG CTATAGTT 
CTCCGGAATA TTAATAGATC ATGGAGATAA TTAAAATGAT 
AGTATTTTAC TGTTTTCGTA ACAGTTTTGT AATAAAAAAA 
TTCATACCGT CCCACCATCG GGCGCGGATC CATGGCCCCT 
TAAGGGCCTT GTGCAACCCA CTCGACTTCT TTTGGAATAT 
GCATTTGTAT GAGCGCGATG AAGGTGATAA ATGGCGAAAC 
GGAGTTTCCC AATCTTCCTT ATTATATTGA TGGTGATGTT 
CATCATACGT TATATAGCTG ACAAGCACAA CATGTTGGGT 
AG AG ATTT C A ATGCTTGAAG GAGCGGTTTT GGATATTAGA 
ATATAGTAAA G AC TTTG AAA CTCTCAAAGT TGATTTTCTT 
GAAAATGTTC GAAGATCGTT TATGTCATAA AACATATTTA 
TCCTGACTTC ATGTTGTATG ACGCTCTTGA TGTTGTTTTA 
GGATGCGTTC C C AAAATT AG TTTGTTTTAA AAAACGTATT 
TAAGTACTTG AAATCCAGCA AGTATATAGC ATGGCCTTTG 
TGGTGGTGGC GACCATCCTC CAAAATCGGA TCTGGTTCCG 
GTACAAAAAA GCTGAACGAG AAACGTAAAA TGATATAAAT 
TTTGCATAAA AAACAGACTA CATAATACTG TAAAACACAA 
GCCGCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG 
TTGAGTTAGG ATCCGGCGAG ATTTTCAGGA GCTAAGGAAG 
ACTGGATATA CCACCGTTGA TATATCCCAA TGGCATCGTA 
CAGTCAGTTG CTCAATGTAC CTATAAC C AG ACCGTTCAGC 
AAG AC CGTAA AGAAAAATAA G C AC AAGTTT TATCCGGCCT 
CTGATGAATG CTCATCCGGA ATTCCGTATG GCAATGAAAG 
GATAGTGTTC AC C C TTGTTA CACCGTTTTC C ATGAG CAAA 
TGGAGTGAAT AC C ACGACG A TTTCCGGCAG TTTCTACACA 
TGTTACGGTG AAAACCTGGC CTATTTCCCT AAAGGGTTTA 
TCAGCCAATC CCTGGGTGAG TTTCACCAGT TTTGATTTAA 
TTCTTCGCCC CCGTTTTCAC CATGGGCAAA TATTATACGC 
CCGCTGGCGA TTCAGGTTCA TCATGCCGTC TGTGATGGCT 
AATGAATTAC AACAGTACTG CGATGAGTGG CAGGGCGGGG 
TTACTAAAAG CCAGATAACA GTATGCGTAT TTGCGCGCTG 
TATACTGATA TGTATACCCG AAGTATGTCA AAAAGAGGTG 
CAGTGACAGT TGACAGCGAC AGCTATCAGT TGCTCAAGGC 
CGGTCTGGTA AGCACAACCA TGCAGAATGA AGCCCGTCGT 
AGCGGAAAAT CAGGAAGGGA TGGCTGAGGT CGCCCGGTTT 
TGCTGACGAG AACAGGGACT GGTGAAATGC AGTTTAAGGT 
GCCGTTATCG TCTGTTTGTG GATGTACAGA GTGATATTAT 
TGGTGATCCC CCTGGCCAGT GCACGTCTGC TGTCAGATAA 
CGGTGGTGCA TATCGGGGAT GAAAGCTGGC GCATGATGAC - 



f)6,[/^ Ltd 





27 01 CACCGATATG GCCAGTGTGC CGGTCTCCGT TATCGGGGAA GAAGTGG CTG ATCTCAGCCA 

27 61 CCGCGAAAAT GACATCAAAA ACGC CATTAA CCTGATGTTC TGGGGAATAT AAATGTCAGG 

28 21 CTC C CTTATA CACAGCCAGT CTGCAGGTCG AC C ATAGTGA CTGGATATGT TGTGTTTTAC 
28 81 AGTATTATGT AGTCTGTTTT TTATGCAAAA TCTAATTTAA TATATTGATA TTTATATCAT 

2 941 TTTACGTTTC TCGTTCAGCT TTCTTGTACA AAGTGGTTTG ATAGCTTGTC GAGAAGTACT 

3 001 AGAGGATCAT AAT C AG C CAT AC C AC ATTTG TAGAGGTTTT ACTTGCTTTA AAAAACCTCC 
3 061 CACACCTCCC CCTGAACCTG AAACATAAAA TGAATGCAAT TGTTGTTGTT AACT.TGTTTA 
3121 TTGCAGCTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC AAATTT C AC A AAT AAAG CAT 
3181 TTTTTTCACT GCATTCTAGT TGTGGTTTGT C C AAAC T CAT CAATGTATCT TATCATGTCT 
3241 GGATCTGATC ACTGCTTGAG CCTAGGAGAT CCGAACCAGA TAAGTGAAAT CTAGTTCCAA 
3301 ACTATTTTGT CATTTTTAAT TTTCGTATTA GCTTACGACG CTACACCCAG TTCCCATCTA 
3361 TTTTGT C ACT CTTCCCTAAA TAATCCTTAA AAACTCCATT TCCACCCCTC CCAGTTCCCA 
3421 ACTATTTTGT CCGCCCACAG CGGGGCATTT TTCTTCCTGT TATGTTTTTA ATCAAACATC 
34 81 CTGCCAACTC CATGTGACAA AC CGTC ATCT TCGGCTACTT TTTCTCTGTC ACAGAATGAA 
3 541 AATTTTTCTG TCATCTCTTC GTTATTAATG TTTGTAATTG ACTGAATATC AACGCTTATT 
36 01 TGCAGCCTGA ATGGCGAATG GACGCGCCCT GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG 
3661 TGGTTACGCG CAGCGTGACC GCTACACTTG CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT 
3 721 TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG TCAAGCTCTA AATCGGGGGC 
3 781 TCCCTTTAGG GTTCCGATTT AGTGCTTTAC GGCACCTCGA CCCCAAAAAA CTTGATTAGG 
3841 GTGATGGTTC ACGTAGTGGG CCATCGCCCT GATAGACGGT TTTTCGCCCT TTGACGTTGG 
3 901 AGTCCACGTT CTTTAATAGT GGACTCTTGT TCCAAACTGG AACAACACTC AACCCTATCT 

3 961 CGGTCTATTC TTTTGATTTA TAAGGGATTT TGCCGATTTC GGCCTATTGG TTAAAAAATG 

4 021 AGCTGATTTA ACAAAAATTT AACGCGAATT TTAACAAAAT ATTAACGTTT ACAATTTCAG 
4 081 GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT 
4141 CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA 
4 2 01 GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT 
4261 GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT 
4 321 TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT 
43 81 TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG 
4441 TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA 
4 501 ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA 
4 561 GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA 
4 621 CAACGATCGG AGGAC CGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA 
4681 CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATG AAG C CAT ACCAAACGAC GAGCGTGACA 
4 741 CCACGATGCC TGTAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA 
48 01 CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC 
4861 TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC 
4 921 GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG 

4 981 TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA 

5 041 TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT 
5101 AG ATT G ATTT AAAAC TT CAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA 
5161 ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG 
5221 AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA 
52 81 CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT 
5341 TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AG AT AC C AAA TACTGTCCTT CTAGTGTAGC 



54 01 CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 
54 61 TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA 
5 521 GACGATAGTT AC C G GAT AAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TG C AC AC AG C 
5 581 CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CATTGAGAAA 
5 641 GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA 
57 01 CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGC CTG GTATCTTTAT AGTCCTGTCG 

57 61 GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC 
5 821 TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 

58 81 CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG 

5 941 AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG 

6 001 AAG C G G AAG A GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC 
6 061 G C AG AC C AG C CGCGTAACCT GGCAAAATCG GTTACGGTTG AG T AAT AAAT GGATGCCCTG 
6121 CGTAAGCGGG TGTGGGCGGA CAATAAAGTC TTAAACTGAA CAAAATAGAT CTAAACTATG - 




6181 ACAATAAAGT CTTAAAC TAG ACAGAATAGT TGTAAACTGA AATCAGTCCA GTTATGCTGT 
6 241 G AAAAAG CAT ACTGGACTTT TGTTATGGCT AAAGCAAACT CTTCATTTTC TGAAGTGCAA 
6 3 01 ATTGCCCGTC GTATTAAAGA GGGGCGTGGC CAAGGGCATG GTAAAGACTA TATTCGCGGC 
6 3 61 GTTGTGACAA TTTACCGAAC AACTCCGCGG CCGGGAAGCC GATCTCGGCT TGAACGAATT 
6421 GTTAGGTGGC GGTACTTGGG TCGATATCAA AGTGCATCAC TTCTTCCCGT ATGCCCAACT 
64 81 TTGTATAGAG AGCCACTGCG GGATCGTCAC CGTAATCTGC TTGCACGTAG ATCACATAAG 
6 541 CACCAAGCGC GTTGGCCTCA TGCTTGAGGA GATTGATGAG CGCGGTGGCA ATGCCCTGCC 
6 601 TCCGGTGCTC GCCGGAGACT G CGAGATC AT AGATATAGAT CTCACTACGC GGCTGCTCAA 
6 661 ACCTGGGCAG AACGTAAGCC GCGAGAGCGC CAACAACCGC TTCTTGGTCG AAGGCAGCAA 
6721 GCGCGATGAA TGTCTTACTA CGGAGCAAGT TCCCGAGGTA ATCGGAGTCC GGCTGATGTT 
6 781 GGGAGTAGGT GGCTACGTCT CCGAACTCAC GAC CGAAAAG AT C AAG AG C A GCCCGCATGG 
6841 ATTTGACTTG GTCAGGGCCG AGCCTACATG TGCGAATGAT GCCCATACTT GAGCCACCTA 
6 9 01 ACTTTGTTTT AGGGCGACTG CCCTGCTGCG TAACATCGTT GCTGCTGCGT AACATCGTTG 

6 961 CTGCTCCATA ACATCAAACA TCGACCCACG GCGTAACGCG CTTGCTGCTT GGATGCCCGA 

7 021 GGCATAGACT GTACAAAAAA ACAGTCATAA CAAGCCATGA AAACCG 




Figure 4//) 





p^r 2 r 2-Hybrid Vector 

DNA-Binding Domain 



700 ^ttg. 9C9 



„aac qq 



9^tt t^gc/tat 
c gaa acq ata 



^aa gXa< ^aa at a gac ctg 



gacctgcaa tt 



tta tta fete 

6tt c*t att ta t *ctq oac qtt aat aat tag 



ttt -tgt^ 



751 A ttc etc gtc att gtt etc gtt ccc ttt ctt cct ta£ ttc ,J£tt- ttc fcge. aca'*' 
j.aaq' oaq ,cacr caa gag caa qqq aaa ga a _gga._aca ,^g_^, aaqlac.g, j;g t /, 



802 
853 

1261 

1312 



ata ttt caa get ata cca age ata caa tea act| cca age 



3 



ttg aag 
ttc 



h tat aaa qtt caa tAt got t eg tat gtt aat tqal gat teg aac 

<x«*t Tyz%«sU ri^ K ~ <> s (boJ4-D& 

tec tga aag |atg aag eta ctg tct tct ate gaa caa gca tgc gat 
agg act ttc tac ttc gat gac aga aga tag ctt gtt cgt acg eta 



caa gec 
gtt egg 

att tgc/f 
taa ac^ 

gaa gag agt agt aac aaa ggt caa aga cag ttg act gta tc\r 6cg a§*g tc*g 
ctt etc tea tea ttg ttt cca gtt tct gtc aac tga cat age age tec age 

N q_ . T S t- j Y K K A G &t fc l 

aat caa 
tta gtt 



aca agt tt jg tac aaa aaa get gaa cga gaa acg taa aat gat ata 
tgt tea aac atg tttl ttt cga ctt get ctt tgc att tta eta tat 



-a 

7/ 



\ \ 

AT. 



| v% 



.Pacl.40 



.Bsgl.795 
r.HindIII.837 



10293,BgM % 
9709,Mlul.. 



.Notl,1443 



m 



3 »!? 



8054,Sgfl 
7713,Nrul 



7381,Eco72l/ 
7356 t Swa! 




.Acclli.1901 
.Ncol.2206 
/ .Xbal.2352 
: /.BssHil f 2399 
'/ .BbvCI.2629 
/.Srfl,2773 
•*/Sma!,2774 
r.Xmai,2774 



Sall.3033 

t%Aatll,3186 
\Ed136ll f 3192 
Sad, 3 192 



Fspl.3863 
Dralll.4114 



6619,Tth111! 



\ BstEII.5092 
Clal.5209 
Xcml.5608 



pDEST21 11713 bp (rotated to position 11000) 



Location (Base Nos . ) 



857 
1456 . 
1706 . 
2485 . 
2707 . 
3053 . 
3716 . 
3899 . 
4414 . 
7541 . 
9668 . 
11118 



1322 
.1332 
.2365 
.2569 
.3012 
.3177 
.3735 
.4354 
.6642 
.8515 
. 10958 
.848 



Gene Encoded 

GAL4DB 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

pT7 (T7 promoter) 

fl (fl intergenic region) 

Leu2 

kanR 

CYH2 

pADH (ADH promoter) 



1 TTTATTATGT TACAATATGG AAGGGAACTT 
61 AAGTCCAATG C T AGT AG AG A AGGGGGGTAA 
121 CTAAACCGTG GAATATTTCG GATATC CTTT 
181 CTCTTTTCTG GCAACCAAAC CCATACATCG 
241 TAACATGTAG GTGGCGGAGG G GAG AT AT AC 
3 01 GGCTAAACAA GACTACACCA ATTACACTGC 
361 ACTGTAGCCC TAGACTTGAT AGC C AT CATC 
421 TGCCATCTAT TGAAGTAATA ATAGG C G CAT 
481 TCTCCCCCGT TGTTGTCTCA CCATATCCGC 
541 AGGAAAAAAT TAACGACAAA GACAGC AC C A 
6 01 GGTATCTTCG AACACACGAA ACTTTTTCCT 
661 AG C AACGGTA TACGGCCTTC CTTCCAGTTA 
721 TTTGCTATCA AGTATAAATA GACCTGCAAT 
781 TCGTTCCCTT TCTTCCTTGT TTCTTTTTCT 
841 AATCAACTCC AAGCTTGAAG CAAGCCTCCT 
901 AAGCATGCGA TATTTGCCGA CTTAAAAAGC 
961 CCAAGTGTCT GAAGAACAAC TGGGAGTGTC 
1021 TGACTAGGGC ACATCTGACA GAAGTGGAAT 
1081 TACTGATTTT TCCTCGAGAA G AC C TTG AC A 
1141 TAAAAG C ATT GTTAACAGGA TTATTTGTAC 

12 01 ATAGATTGGC TTCAGTGGAG ACTGATATGC 
1261 CGACATCATC ATC GGAAGAG AGTAGTAACA 
1321 GGTCGAATCA AACAAGTTTG TACAAAAAAG 

13 81 TCAATATATT AAATTAGATT TTGCATAAAA 
1441 ATATCCAGTC ACTATGGCGG CCGCTAAGTT 
1501 ATAAATACCT GTGACGGAAG ATCACTTCGC 
1561 TACCGGGAAG CCCTGGGCCA ACTTTTGGCG 
1621 TTCCAACTTT CACCATAATG AAATAAGATC 
,1681 ATTTTCAGGA GCTAAGGAAG CTAAAATGGA 
1*741 TATATCCCAA TGGCATCGTA AAGAACATTT 
18 01 CTATAAC C AG ACCGTTCAGC TGGATATTAC 
1861 GCACAAGTTT TATCCGGCCT TTATTCACAT 
1921* ATTCCGTATG GCAATGAAAG ACGGTGAGCT 
1981 CACCGTTTTC CATGAGCAAA CTGAAACGTT 
2 041 TTTCCGGCAG TTTCTACACA TATATTCGCA 
2101 CTATTTCCCT AAAGGGTTTA TTGAGAATAT 
2161 TTTCACCAGT TTTGATTTAA ACGTGGCCAA 
22 21 CATGGGCAAA TATTATACGC AAGGCGACAA 
22 81 TCATGCCGTC TGTGATGGCT TCCATGTCGG 
2 341 CGATGAGTGG CAGGGCGGGG CGTAATCTAG 
24 01 GTATGC GTAT TTGCGCGCTG ATTTTTG C GG 



TACACTTCTC CTATGCACAT ATATTAATTA 
CACCCCTCCG CGCTCTTTTC CGATTTTTTT 
TGTTGTTTCC GGGTGTACAA TATGGACTTC 
GGATTCCTAT AATACCTTCG TTGGTCTCCC 
AATAGAACAG ATACCAGACA AGACATAATG 
CTCATTGATG GTGGTACATA ACGAACTAAT 
ATATCGAAGT TTCACTACCC TTTTTCCATT 
GCAACTTCTT TTCTTTTTTT TTCTTTTCTC 
AATGACAAAA AAAATGATGG AAGACACTAA 
ACAGATGTCG TTGTTCCAGA GCTGATGAGG 
TCCTTCATTC ACGCACACTA CTCTCTAATG 
CTTGAATTTG AAATAAAAAA AGTTTGCCGC 
TATTAATCTT TTGTTTCCTC GTCATTGTTC 
GCACAATATT TCAAGCTATA CCAAGCATAC 
GAAAGATGAA GCTACTGTCT TCTATCGAAC 
TCAAGTGCTC CAAAGAAAAA CCGAAGTGCG 
GCTACTCTCC CAAAACCAAA AGGTCTCCGC 
CAAGGCTAGA AAGACTGGAA CAGCTATTTC 
TGATTTTGAA AATGGATTCT TTACAGGATA 
AAGATAATGT GAATAAAGAT GCCGTCACAG 
CTCTAACATT GAGACAGCAT AGAATAAGTG 
AAGGTCAAAG ACAGTTGACT GTATCGTCGA 
C TGAACGAGA AACGTAAAAT GATATAAATA 
AACAGACTAC ATAATACTGT AAAACACAAC 
GG C AG CATC A CCCGACGCAC TTTGCGCCGA 
AGAATAAATA AATCCTGGTG TCCCTGTTGA 
AAAATGAGAC GTTGATCGGC ACGTAAGAGG 
ACTACCGGGC GTATTTTTTG AGTTAT C GAG 
GAAAAAAATC ACTGGATATA CCACCGTTGA 
TGAGGCATTT CAGTCAGTTG CTCAATGTAC 
GGCCTTTTTA AAGAC CGTAA AGAAAAATAA 
TCTTGCCCGC CTGATGAATG CTCATCCGGA 
GGTGATATGG GATAGTGTTC ACCCTTGTTA 
TTCATCGCTC TGGAGTGAAT AC C AC G AC G A 
AGATGTGGCG TGTTACGGTG AAAACCTGGC 
GTTTTTCGTC TCAGCCAATC CCTGGGTGAG 
TATGGACAAC TTCTTCGCCC CCGTTTTCAC 
GGTGCTGATG CCGCTGGCGA TTCAGGTTCA 
CAGAATGCTT AATGAATTAC AACAGTACTG 
AGGATCCGGC TTACTAAAAG C C AG ATAAC A 
TATAAGAATA TATACTGATA TGTATACCCG- 




24 61 AAGTATGT C A AAAAGAGGTG TGCTATGAAG CAGCGTATTA CAGTGACAGT TGACAGCGAC 
2 521 AGCTATCAGT TGCTCAAGGC ATATATGATG TCAATATCTC CGGTCTGGTA AGCACAACCA 
2 581 TGCAGAATGA AGCCCGTCGT CTGCGTGCCG AACGCTGGAA AG C GGAAAAT CAGGAAGGGA 
2 641 TGGCTGAGGT CGCCCGGTTT ATTGAAATGA ACGGCTCTTT TGCTGACGAG AACAGGGACT 
2701 GGTGAAATGC AGTTTAAGGT TTAC AC C TAT AAAAGAGAGA GCCGTTATCG TCTGTTTGTG 
2761 GATGTACAGA GTGATATTAT TGACACGCCC GGGCGACGGA TGGTGATCCC CCTGGCCAGT 
2821 GCACGTCTGC TGTCAGATAA AGTCTCCCGT GAACTTTACC CGGTGGTGCA TATCGGGGAT 
2 8 81 GAAAGCTGGC GCATGATGAC CACCGATATG GCCAGTGTGC CGGTCTCCGT TATCGGGGAA 

2 941 GAAGTGGCTG ATCTCAGCCA CCGCGAAAAT G AC AT C AAAA AC G C C ATT AA CCTGATGTTC 

3 0 01 TGGGGAATAT AAATGTCAGG CTCCCTTATA CACAGCCAGT CTGCAGGTCG ACCATAGTGA 
3 061 CTGGATATGT TGTGTTTTAC AGTATTATGT AGTCTGTTTT TTATGCAAAA TCTAATTTAA 
3121 TATATTGATA TTTATATCAT TTTACGTTTC TCGTTCAGCT TTCTTGTACA AAGTGGTTTG 
3181 ATGGCCGCTA AGTAAGTAAG ACGTCGAGCT CTAAGTAAGT AACGGCCGCC ACCGCGGTGG 
3241 AGCTTTGGAC TTCTTCGCCA GAGGTTTGGT CAAGTCTCCA ATCAAGGTTG TCGGCTTGTC 
3 3 01 TACCTTGCCA GAAATTTACG AAAAGATGGA AAAGGGTCAA AT C GTTGGT A GATACGTTGT 
33 61 TGACACTTCT AAATAAGCGA ATTTCTTATG ATTTATGATT TTTATTATTA AATAAGTTAT 
3 421 AAAAAAAATA AGTGTATACA AATTTTAAAG TGACTCTTAG GTTTTAAAAC GAAAATTCTT 
3 481 ATTCTTGAGT AACTCTTTCC TGTAGGTCAG GTTGCTTTCT CAGGTATAGC ATGAGGTCGC 
3 541 TCTTATTGAC CACACCTCTA CCGGCATGCC GAG CAAATGC CTGCAAATCG CTCCCCATTT 
36 01 CACCCAATTG TAGATATGCT AACTCCAGCA ATGAGTTGAT GAATCTCGGT GTGTATTTTA 
3 661 TGTCCTCAGA GGAC AATAC C TGTTGTAATC GTTCTTCCAC ACGGATCCCA ATTCGCCCTA 
3721 TAGTGAGTCG TATTACAATT CACTGGCCGT CGTTTTACAA CGTCGTGACT GGGAAAACCC 
3 781 TGGCGTTACC CAACTTAATC GCCTTGCAGC ACATCCCCCT TTCGCCAGCT GGCGTAATAG 
3 841 CGAAGAGGCC CGCACCGATC GCCCTTCCCA ACAGTTGCGC AGC CTGAATG GCGAATGGAC 
3 901 GCGCCCTGTA GCGGCGCATT AAGCGCGGCG GGTGTGGTGG TTACGCGCAG CGTGACCGCT 

3 961 ACACTTGCCA GCGCCCTAGC GCCCGCTCCT TTCGCTTTCT TCCCTTCCTT TCTCGCCACG 

4 021 TTCGCCGGCT TTCCCCGTCA AGCTCTAAAT CGGGGGCTCC CTTTAGGGTT CCGATTTAGT 
4 081 GCTTTACGGC ACCTCGACCC CAAAAAACTT GATTAGGGTG ATGGTTCACG TAGTGGGCCA 
4141 TCGCCCTGAT AGACGGTTTT TCGCCCTTTG ACGTTGGAGT CCACGTTCTT TAATAGTGGA 
42 01 CTCTTGTTCC AAACTGGAAC AACACTCAAC CCTATCTCGG TCTATTCTTT TGATTTATAA 
4261 GGGATTTTGC CGATTTCGGC CTATTGGTTA AAAAATGAGC TGATTTAACA AAAATTTAAC 
4 321 GCGAATTTTA ACAAAATATT AACGTTTACA ATTTCCTGAT GCGGTATTTT CTCCTTACGC 
4 381 ATCTGTGCGG TATTTCACAC CGCATATCGA CCGGTCGAGG AGAACTTCTA GTATATCCAC 
4441 ATACCTAATA TTATTGCCTT ATTAAAAATG GAATCGGAAC AATTACATCA AAATC C AC AT 
4 501 TCTCTTCAAA ATCAATTGTC CTGTACTTCC TTGTTCATGT GTGTTCAAAA ACGTTATATT 
4 561 TATAGGATAA TTATACTCTA TTTCTCAACA AGTAATTGGT TGTTTGGCCG AGCGGTCTAA 
4 621 GGCGCCTGAT TCAAGAAATA TCTTGACCGC AGTTAACTGT GGGAATACTC AGGTATCGTA 
4681 AGATGCAAGA GTTCGAATCT CTTAGCAACC ATTATTTTTT TCCTCAACAT AACGAGAACA 
4 741 CACAGGGGCG CTATCGCACA GAATCAAATT CGATGACTGG AAATTTTTTG TTAATTTCAG 
4 8 01 AGGTCGCCTG AC G CAT AT AC CTTTTTCAAC TGAAAAATTG GGAGAAAAAG GAAAGGTGAG 
4 861 AGGCCGGAAC CGGCTTTTCA TATAGAATAG AGAAGCGTTC ATGACTAAAT GCTTGCATCA 
4 921 CAATACTTGA AGTTGACAAT ATTATTTAAG GACCTATTGT TTTTTCCAAT AGGTGGTTAG 

4 981 CAATCGTCTT ACTTTCTAAC TTTTCTTACC TTTTACATTT CAGCAATATA TATATATATT 

5 041 TCAAGGATAT AC C ATT C T AA TGTCTGCCCC TATGTCTGCC C CTAAGAAG A TCGTCGTTTT 
5101 GCCAGGTGAC CACGTTGGTC AAGAAATCAC AGCCGAAGCC ATTAAGGTTC TTAAAG CTAT 

'5161 TTCTGATGTT CGTTCCAATG TCAAGTTCGA TTTCGAAAAT CATTTAATTG GTGGTGCTGC 
5221 TATCGATGCT AC AGGTGTC C CACTTCCAGA TGAGGCGCTG GAAGCCTCCA AGAAGGTTGA 
52 81 TGCCGTTTTG TTAGGTGCTG TGGGTGGTCC TAAATGGGGT ACCGGTAGTG TTAGAC CTGA 
534 Ov ACAAGGTTTA CTAAAAATCC GTAAAGAACT TCAATTGTAC GCCAACTTAA GACCATGTAA 
54 01 CTTTGCATCC GACTCTCTTT TAGACTTATC TCCAATCAAG C C AC AATTTG CTAAAGGTAC 
54 61 TGACTTCGTT GTTGTCAGAG AATTAGTGGG AGGTATTTAC TTTGGTAAGA GAAAGGAAGA 
5 521 CGATGGTGAT GGTGTCGCTT GGGATAGTGA AC AATAC AC C GTTCCAGAAG TGCAAAGAAT 
5581 CACAAGAATG GCCGCTTTCA TGGCCCTACA ACATGAGCCA CCATTGCCTA TTTGGTCCTT 
5641 GGATAAAGCT AATGTTTTGG CCTCTTCAAG ATT ATG GAGA AAAACTGTGG AGGAAACCAT 
5701 CAAGAACGAA TTCCCTACAT TGAAGGTTCA ACATCAATTG ATTGATTCTG CCGCCATGAT 
5761 CCTAGTTAAG AACCCAACCC ACCTAAATGG TATTATAATC ACCAGCAACA TGTTTGGTGA 
5 821 TATCATCTCC GATGAAGCCT CCGTTATCCC AGGTTCCTTG GGTTTGTTGC CATCTGCGTC 
5 881 CTTGGCCTCT TTGCCAGACA AGAACACCGC ATTTGGTTTG TACGAACCAT GCCACGGTTC- 





5941 TGCTCCAGAT TTGCCAAAGA ATAAGGTTGA CCCTATCGCC ACTATCTTGT CTGCTGCAAT 
6 001 GATGTTGAAA TTGTCATTGA ACTTGCCTGA AGAAGGTAAG GCCATTGAAG ATGCAGTTAA 
6 061 AAAGGTTTTG GATGCAGGTA TCAGAACTGG TGATTTAGGT GGTTCCAACA GTACCACCGA 
6121 AGTCGGTGAT GCTGTCGCCG AAGAAGTTAA GAAAATCCTT GCTTAAAAAG ATTCTCTTTT 
6181 TTTATGATAT TTGTACATAA ACTTTATAAA TGAAATTCAT AATAGAAACG ACACGAAATT 

62 41 ACAAAATGGA ATATGTTCAT AGGGTAGACG AAACTATATA CGCAATCTAC ATACATTTAT 

63 01 CAAGAAGGAG AAAAAGGAGG ATAGTAAAGG AATACAGGTA AGCAAATTGA TACTAATGGC 
6361 TCAACGTGAT AAGGAAAAAG AATTGCACTT TAACATTAAT ATTGACAAGG AGGAGGGCAC 
6421 CACACAAAAA GTTAGGTGTA ACAGAAAATC ATGAAACTAC GATTCCTAAT TTGATATTGG 

64 81 AGGATTTTCT CTAAAAAAAA AAAAATACAA CAAATAAAAA AC AC TCAATG ACCTGACCAT 
6541 TTGATGGAGT TTAAGTCAAT ACCTTCTTGA ACCATTTCCC ATAATGGTGA AAGTTCCCTC 
6 6 01 AAGAATTTTA CTCTGTCAGA AACGGCCTTA CGACGTAGTC GATATGGTGC ACTCTCAGTA 
6661 CAATCTGCTC TGATGCCGCA TAGTTAAGCC AGCCCCGACA CCCGCCAACA CCCGCTGACG 
6721 CGCCCTGACG GGCTTGTCTG CTCCCGGCAT CCGCTTACAG ACAAGCTGTG ACCGTCTCCG 
6781 GG AG C TG CAT GTGTCAGAGG TTTTCACCGT CATCACCGAA ACGCGCGAGA CGAAAGGGCC 
6841 TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGGACGGAT 
6 9 01 CGCTTGCCTG TAACTTACAC GCGCCTCGTA TCTTTTAATG ATGGAATAAT TTGGGAATTT 

6 961 ACTCTGTGTT TATTTATTTT TATGTTTTGT ATTTGGATTT TAGAAAGTAA ATAAAGAAGG 

7 021 TAGAAGAGTT ACGGAATGAA GAAAAAAAAA TAAACAAAGG TTTAAAAAAT TTCAACAAAA 
7 081 AGCGTACTTT ACATATATAT TTATTAGACA AG AAAAG C AG ATTAAATAGA TATACATTCG 
7141 ATTAACGATA AGTAAAATGT AAAATCACAG GATTTTCGTG TGTGGTCTTC TACACAGACA 
72 01 AGATGAAACA ATTCGGCATT AATACCTGAG AG C AGGAAGA GCAAGATAAA AGGTAGTATT 
72 61 TGTTGGCGAT CCCCCTAGAG TCTTTTACAT CTTCGGAAAA CAAAAACTAT TTTTTCTTTA 
7321 ATTTCTTTTT TTACTTTCTA TTTTTAATTT ATATATTTAT ATTAAAAAAT TTAAATTATA 
7381 ATTATTTTTA TAGCACGTGA TGAAAAGGAC CCAGGTGGCA CTTTTCGGGG AAATGTGCGC 
7441 GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA TGTATCCGCT CATGAGACAA 
7501 TAACCCTGAT AAATGCTTCA ATAATC TG C A GCTCTGGCCC GTGTCTCAAA ATCTCTGATG 
7561 TTACATTGCA CAAGATAAAA ATATATCATC ATGAACAATA AAACTGTCTG C TT AC AT AAA 
76 21 CAGTAATACA AGGGGTGTTA TGAGCCATAT TCAACGGGAA ACGTCTTGCT GGAGGCCGCG 
76 81 ATTAAATTCC AACATGGATG CTGATTTATA TG GGTAT AAA TGGGCTCGCG ATAATGTCGG 
7741 GCAATCAGGT GCGACAATCT TTCGATTGTA TGGGAAGCCC GATGCGCCAG AGTTGTTTCT 
78 01 GAAACATGGC AAAGGTAGCG TTGCCAATGA T G TT AC AG AT GAGATGGTCA GACTAAACTG 
78 61 GCTGACGGAA TTTATGCCTC TT C C G AC CAT CAAGCATTTT ATCCGTACTC CTGATGATGC 
7 921 ATGGTTACTC ACCACTGCGA TCCGCGGGAA AACAGCATTC CAGGTATTAG AAGAATATCC 

7 981 TGATTCAGGT GAAAATATTG TTGATGCGCT GGCAGTGTTC CTGCGCCGGT TGCATTCGAT 

8 041 TCCTGTTTGT AATTGTCCTT TTAACAGCGA TCGCGTATTT CGTCTCGCTC AGGCGCAATC 
8101 ACGAATGAAT AACGGTTTGG TTGATG CGAG TGATTTTGAT GACGAGCGTA ATGGCTGGCC 
8161 TGTTGAACAA GTCTGGAAAG AAATG CAT AC GCTTTTGCCA TTCTCACCGG ATTCAGTCGT 
8221 CACTCATGGT GATTTCTCAC TTGATAACCT TATTTTTGAC GAGGGGAAAT TAATAGGTTG 
82 81 TATTGATGTT GGACGAGTCG GAATCGCAGA CCGATACCAG GATCTTGCCA TCCTATGGAA 
8 341 CTGCCTCGGT GAGTTTTCTC CTTCATTACA GAAACGGCTT TTTCAAAAAT ATGGTATTGA 
84 01 TAATCCTGAT ATGAATAAAT TGCAGTTTCA TTTGATGCTC GATGAGTTTT TCTAATCAGA 
84 61 ATTGGTTAAT TGGTTGTAAC ACTGGCAGAG CATTACGCTG ACTTGACGGG ACGGCGCATG 
8 521 ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT AGAAAAGATC 
8 581 AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA 
/ 8 i 641 CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG 
8 7 01 GTAACTGGCT T C AG C AG AG C GCAGATACCA AATACTGTCC TTCTAGTGTA GCCGTAGTTA 
8 761 GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA 
8 821, CCAGTGGCTG CTGCCAGTGG C G ATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG 
88 81 TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG 
8 941 GAGCGAACGA CCTACACCGA AC TG AG AT AC CTACAGCGTG AG CATTGAGA AAGCGCCACG 
9001 CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG 
9061 CGCACGAGGG AGCTTCCAGG GGGGAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC 
9121 CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCCGAG CCTATGGAAA 
9181 AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG 
9241 TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAACCGTA TTACCGCCTT TGAGTGAGCT 
93 01 GATACCGCTC GCCGCAGCCG AACGACCGAG CGCAGCGAGT CAGTGAGCGA GGAAGCGGAA 
9361 GAGCGCCCAA TACGCAAACC GCCTCTCCCC GCGCGTTGGC C G ATT C ATT A ATGCAGCTGG - 




9421 CACGACAGGT TTCCCGACTG GAAAGCGGGC AGTGAGCGCA ACGCAATTAA TGTGAGTTAC 
94 81 CTCACTCATT AGGCACCCCA GGCTTTACAC TTTATGCTTC CGGCTCCTAT GTTGTGTGGA 
9541 ATTGTGAGCG GATAACAATT TCACACAGGA AACAGCTATG ACCATGATTA CGCCAAGCTC 

96 01 GGAATTAACC CTCACTAAAG GGAACAAAAG CTGGTAC CGA TCCCGAGCTT TGCAAATTAA 
9661 AGCCTTCGAG CGTCCCAAAA CCTTCTCAAG CAAGGTTTTC AGTATAATGT TACATGCGTA 
9721 CACGCGTCTG TACAGAAAAA AAAGAAAAAT TTGAAATATA AATAACGTTC TTAATACTAA 

97 81 CATAACTATA AAAAAATAAA TAGGGACCTA GACTTCAGGT TGTCTAACTC CTTCCTTTTC 
9841 GGTTAGAGCG GATGTGGGGG GAGGGCGTGA ATGTAAGCGT GACATAACTA ATTACATGAT 
9901 ATCGACAAAG GAAAAGGGGC CTGTTTACTC ACAGGCTTTT TTCAAGTAGG TAATTAAGTC 
9961 GTTTCTGTCT TTTTCCTTCT TCAACCCACC AAAGGCCATC TTGGTACTTT TTTTTTTTTT 

10021 TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 
10081 TTTTTTTTTT TTTTTTTT tt TCATAGAAAT AATACAGAAG TAGATGTTGA ATTAGATTAA 
10141 ACTGAAGATA TATAATTTAT TGGAAAATAC ATAGAGCTTT TTGTTGATGC GCTTAAGCGA 
10201 TCAATTCAAC AACACCACCA GCAGCTCTGA TTTTTTCTTC AGCCAACTTG GAGAC GAATC 
10261 TAGCTTTGAC GATAACTGGA ACATTTGGAA TTCTACCCTT AC C CAAGATC TTACCGTAAC 
10321 CGGCTGCCAA AGTGTCAATA ACTGGAGCAG TTTCCTTAGA AG C AG ATTT C AAGTATTGGT 
10381 CTCTCTTGTC TTCTGGGATC AATGTCCACA ATTTGTCCAA GTTCAAGACT GGCTTCCAGA 
10441 AATGAGCTTG TTGCTTGTGG AAGTATCTCA TACCAACCTT AC C G AAATAA CCTGGATGGT 
10501 ATTTATCCAT GTTAATTCTG TGGTGATGTT GACCACCGGC CATACCTCTA CCACCGGGGT 
10561 GCTTTCTGTG CTTAC C GAT A CGACCTTTAC CGGCTGAGAC GTGACCTCTG TGCTTTCTAG 
10621 TCTTAGTGAA TCTGGAAGGC ATTCTTGATT AGTTGGATGA TTGTTCTGGG ATTTAATGCA 
10681 AAAATCACTT AAGAAGGAAA ATCAACGGAG AAAGCAAACG CCATCTTAAA TATACGGGAT 
10741 AC AG AT G AAA GGGTTTGAAC CTATCTGGAA AATAGCATTA AACAAGCGAA AAACTG C GAG 
108 01 GAAAATTGTT TGCGTCTCTG CGGGCTATTC ACGCGCCAGA GGAAAATAGG AAAAATAACA 
10861 GGGCATTAGA AAAATAATTT TGATTTTGGT AATGTGTGGG TCCTGGTGTA CAGATGTTAC 
10 921 ATTGGTTACA GTACTCTTGT TTTTGCTGTG TTTTTCGATG AATCTC C AAA ATGGTTGTTA 
10981 GCACATGGAA GAGTCACCGA TGCTAAGTTA TCTCTATGTA AGCTACGTGG CGTGACTTTT 
11041 GATGAAG C CG CACAAGAGAT ACAGGATTGG CAACTGCAAA TAGAATCTGG GGATCCCCCC 
11101 TCGAGATC CG GGATCGAAGA AATGATGGTA AATGAAATAG GAAATCAAGG AGCATGAAGG 
11161 CAAAAGACAA ATATAAGGGT CGAACGAAAA ATAAAGTGAA AAGTGTTGAT ATGATGTATT 
112 21 TGGCTTTGCG GCGCCGAAAA AACGAGTTTA CGCAATTGCA CAATCATGCT GACTCTGTGG 
11281 CGGACCCGCG CTCTTGCCGG CCCGGCGATA ACGCTGGGCG TGAGGCTGTG CCCGGCGGAG 
11341 TTTTTTGCGC CTGCATTTTC CAAGGTTTAC CCTGCGCTAA GGGGCGAGAT TGGAGAAGCA 
114 01 ATAAGAATGC CGGTTGGGGT TGCGATGATG ACGACCACGA CAACTGGTGT CATTATTTAA 
114 61 GTTGCCGAAA GAAC CTGAGT GCATTTGCAA CATGAGTATA CTAGAAGAAT GAGC CAAG AC 
11521 TTGCGAGACG CGAGTTTGCC GGTGGTGCGA AC AAT AG AG C GACCATGACC TTGAAGGTGA 
11581 GACGCGCATA ACCGCTAGAG TACTTTGAAG AGGAAACAG C AATAGGGTTG CTACCAGTAT 
11641 AAATAGACAG GTACATACAA CACTGGAAAT GGTTGTCTGT TTGAGTACGC TTTCAATTCA 
117 01 TTTGGGTGTG CAC 
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904 . . 1248 
1388 . . 1264 
1638 . .2297 
2417 . . 2501 
2639 . .2944 
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3831 . .4318 
4334 . .5176 
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Gene Encoded 
GAL 4 AD 
attRl 
CmR 

inactivated ccdA 

ccdB 

attR2 

fl (fl intergenic region) 

TRP1 

ampR 

pADH (yeast ADH promoter) 



1 TTCATTTGGG TGTGCACTTT ATTATGTTAC AATATGGAAG GGAACTTTAC ACTTCTCCTA 
61 TGCACATATA TTAATTAAAG TCCAATGCTA GTAGAGAAGG GGGGTAACAC CCCTCCGCGC 
121 TCTTTTCCGA TTTTTTTCTA AACCGTGGAA TATTTCGGAT ATCCTTTTGT TGTTTCCGGG 
181 TGTACAATAT GGACTTCCTC TTTTCTGGCA ACCAAACCCA TACATCGGGA TTCCTATAAT 
241 ACCTTCGTTG GTCTCCCTAA CATGTAGGTG GCGGAGGGGA GATATACAAT AGAACAGATA 
3 01 C C AG AC AAG A CATAATGGGC TAAACAAGAC TACACCAATT ACACTGCCTC ATTGATGGTG 
3 61 GTACATAACG AACTAATACT GTAGCCCTAG AC TTG AT AG C CATCATCATA TCGAAGTTTC 
421 ACTACCCTTT TTCCATTTGC CATCTATTGA AGTAATAATA GGCGCATGCA ACTTCTTTTC 
481 TTTTTTTTTC TTTTCTCTCT CCCCCGTTGT TGTCTCACCA TATCCGCAAT GACAAAAAAA 
541 ATGATGGAAG ACACTAAAGG AAAAAATTAA C G AC AAAG AC AG C AC C AAC A GATGTCGTTG 
601 TTCCAGAGCT GATGAGGGGT ATCTTCGAAC ACACGAAACT TTTTCCTTCC TTCATTCACG 
661 CACACTACTC TCTAATGAGC AACGGTATAC GGCCTTCCTT CCAGTTACTT GAATTTGAAA 
721 TAAAAAAAGT TTGCCGCTTT GCTATCAAGT ATAAATAGAC CTGCAATTAT TAATCTTTTG 
781 TTTCCTCGTC ATTGTTCTCG TTCCCTTTCT TCCTTGTTTC TTTTTCTGCA C AATATTT C A 
841 AG CTATAC C A AG C AT AC AAT CAACTCCAAG CTTATGCCCA AGAAGAAGCG GAAGGTCTCG 
901 AGCGGCGCCA ATTTTAATCA AAGTGGGAAT ATTGCTGATA GCTCATTGTC CTTCACTTTC 
961 ACTAACAGTA GCAACGGTCC GAACCTCATA ACAACTCAAA CAAATTCTCA AGCGCTTTCA 
1021 CAACCAATTG CCTCCTCTAA CGTTCATGAT AACTTCATGA ATAATGAAAT CACGGCTAGT 
1081 AAAATTGATG ATGGTAATAA TTCAAAAC C A CTGTCACCTG GTTGGACGGA CCAAACTGCG 
1141 TATAACG CGT TTGGAATCAC TACAGGGATG TTTAATAC C A CTACAATGGA TGATGTATAT 
1201 AACTATCTAT TCGATGATGA AGATACCCCA CCAAACCCAA AAAAAGAGGG TGGGTCGAAT 
12 61 CAAACAAGTT TGTACAAAAA AGCTGAACGA GAAACGTAAA ATGATATAAA TATCAATATA 
1321 TTAAATTAGA TTTTGCATAA AAAACAGACT ACATAATACT GTAAAACACA ACATATCCAG 
1381 TCACTATGGC GGCCGCTAAG TTGGCAGCAT CACCCGACGC ACTTTGCGCC GAATAAATAC 
1441 CTGTGACGGA AGATCACTTC G C AG AAT AAA TAAATC CTGG TGTCCCTGTT GATAC CGGGA 
15 01 AGCCCTGGGC CAACTTTTGG C G AAAATG AG ACGTTGATCG G C AC GT AAG A GGTTCCAACT 
1561 TTCACCATAA TGAAATAAGA TCACTACCGG GCGTATTTTT TGAGTTATCG AGATTTTCAG 
1621 GAGCTAAGGA AGCTAAAATG GAGAAAAAAA TCACTGGATA TACCACCGTT GATATATCCC 
1681 AATGGCATCG TAAAGAACAT TTTGAGGCAT TTCAGTCAGT TGCTCAATGT ACCTATAACC 
1741 AGACCGTTCA GCTGGATATT ACGGCCTTTT TAAAGAC CGT AAAGAAAAAT AAG C AC AAG T 
18 01 TTTATCCGGC CTTTATTCAC ATTCTTGCCC GCCTGATGAA TGCTCATCCG GAATTCCGTA 
'^861 TGGCAATGAA AGACGGTGAG CTGGTGATAT GGGATAGTGT TCACCCTTGT TACACCGTTT 
1921 TCCATGAGCA AACTGAAACG TTTTCATCGC TCTGGAGTGA AT AC C ACGAC GATTTCCGGC 
1981 AGTTTCTACA CATATATTCG CAAGATGTGG CGTGTTACGG TGAAAACCTG GCCTATTTCC 
2 041* CTAAAGGGTT TATTGAGAAT ATGTTTTTCG TCTCAGCCAA TCCCTGGGTG AGTTTC AC C A 
2101 GTTTTGATTT AAACGTGGCC AAT ATG G AC A ACTTCTTCGC CCCCGTTTTC ACCATGGGCA 
2161 AATATTATAC GCAAGGCGAC AAGGTGCTGA TGCCGCTGGC GATTCAGGTT CATCATGCCG 
2221 TCTGTGATGG CTTCCATGTC GGCAGAATGC TTAATGAATT ACAACAGTAC TGCGATGAGT 
2281 GGCAGGGCGG GGCGTAATCT AGAGGATCCG GCTTACTAAA AG C C AG AT AA CAGTATGCGT 
2 341 ATTTGCGCGC TGATTTTTGC GGTATAAGAA TATATACTGA T ATGT AT AC C CGAAGTATGT 
24 01 CAAAAAGAGG TGTGCTATGA AG C AG C GTAT TACAGTGACA GTTGACAGCG ACAGCTATCA 
24 61 GTTGCTCAAG GCATATATGA TG T C AAT AT C TCCGGTCTGG TAAGCACAAC CATGCAGAAT 
2 521 GAAGCCCGTC GTCTGCGTGC CGAACGCTGG AAAG C GGAAA ATCAGGAAGG GATGGCTGAG - 




2 581 GTCGCCCGGT TTATTGAAAT GAACGGCTCT TTTGCTGACG AGAACAGGGA CTGGTGAAAT 
2641 GCAGTTTAAG GTTTACACCT ATAAAAGAGA GAGCCGTTAT CGTCTGTTTG TGGATGTACA 
2 7 01 GAGTGATATT ATTGACACGC CCGGGCGACG GATGGTGATC CCCCTGGCCA GTGCACGTCT 
27 61 GCTGTCAGAT AAAGTCTCCC GTGAACTTTA CCCGGTGGTG CATATCGGGG ATGAAAGCTG 
2821 GCGCATGATG ACCACCGATA TGGCCAGTGT GCCGGTCTCC GTTATCGGGG AAGAAGTGGC 
2881 TGATCTCAGC CACCGCGAAA ATGACATCAA AAACGCCATT AACCTGATGT TCTGGGGAAT 

2 941 ATAAATGTCA GGCTCCCTTA TAC AC AG C C A GTCTGCAGGT CGACCATAGT GACTG GAT AT 

3 0 01 GTTGTGTTTT ACAGTATTAT GTAGTCTGTT TTTTATGCAA AATCTAATTT AATATATTGA 
3 061 TATTTATATC ATTTTACGTT TCTCGTTCAG CTTTCTTGTA CAAAGTGGTT TGATGGCCGC 
3121 TAAGTAAGTA AGACGTCGAG CTCTAAGTAA GTAACGGCCG CCACCGCGGT GGAGCTTTGG 
3181 ACTTCTTCGC CAGAGGTTTG GTCAAGTCTC CAATCAAGGT TGTCGGCTTG TCTACCTTGC 
3241 CAGAAATTTA CGAAAAGATG GAAAAGGGTC AAATCGTTGG TAGATACGTT GTTGACACTT 
33 01 CTAAATAAGC GAATTTCTTA TGATTTATGA TTTTTATTAT TAAATAAGTT ATAAAAAAAA 

33 61 TAAGTGTATA CAAATTTTAA AGTGACTCTT AGGTTTTAAA ACGAAAATTC TTATTCTTGA 
3421 GTAACTCTTT CCTGTAGGTC AGGTTGCTTT CTCAGGTATA GCATGAGGTC GCTCTTATTG 

34 81 ACCACACCTC TACCGGCATG CCGAGCAAAT GCCTGCAAAT CGCTCCCCAT TTCACCCAAT 
3 541 TGTAGATATG CTAACTCCAG CAATGAGTTG ATGAATCTCG GTGTGTATTT TATGTCCTCA 
36 01 GAGGACAATA CCTGTTGTAA TCGTTCTTCC ACACGGATCC CAATTCGCCC TATAGTGAGT 

36 61 CGTATTACAA TTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA 
3721 CCCAACTTAA TCGCCTTGCA GCACATCCCC CTTTCGCCAG CTGGCGTAAT AGCGAAGAGG 

37 81 CCCGCACCGA TCGCCCTTCC CAACAGTTGC GCAGCCTGAA TGGCGAATGG ACGCGCCCTG 

38 41 TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG CTACACTTGC 
3 9 01 CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA CGTTCGCCGG 

3 961 CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT CCCTTTAGGG TTCCGATTTA GTGCTTTACG 

4 021 GCACCTCGAC CCCAAAAAAC TTGATTAGGG TGATGGTTCA CGTAGTGGGC CATCGCCCTG 
4 081 ATAGACGGTT TTTCGCCCTT TGACGTTGGA GTCCACGTTC TTTAATAGTG GACTCTTGTT 
4141 CCAAACTGGA ACAACACTCA ACCCTATCTC GGTCTATTCT TTTGATTTAT AAGGGATTTT 
42 01 GCCGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA CAAAAATTTA ACGCGAATTT 
4261 TAACAAAATA TTAACGTTTA CAATTTCCTG ATGCGGTATT TTCTC CTTAC GCATCTGTGC 
4 321 GGTATTTCAC ACCGCAGGCA AGTGCACAAA CAATACTTAA ATAAATACTA CTCAGTAATA 
4381 ACCTATTTCT TAG C ATTTTT GACGAAATTT GCTATTTTGT TAGAGTCTTT TAC AC C ATTT 
4441 GTCTCCACAC CTCCGCTTAC ATCAACACCA ATAAC G C CAT TTAAT CTAAG CGCATCACCA 
4 5 01 ACATTTTCTG GCGTCAGTCC ACCAGCTAAC ATAAAATGTA AGCTTTCGGG GCTCTCTTGC 
4561 CTTCCAACCC AGTCAGAAAT CGAGTTCCAA TCCAAAAGTT CACCTGTCCC ACCTGCTTCT 
4621 GAATCAAACA AGGGAATAAA CGAATGAGGT TTCTGTGAAG CTGCACTGAG TAGTATGTTG 
4681 CAGTCTTTTG G AAAT AC GAG T CTTTTAATA ACTGGCAAAC CGAGGAACTC TTGGTATTCT 
4 741 TGCCACGACT CATCTCCATG CAGTTGGACG ATATCAATGC CGTAATCATT GACCAGAGCC 
48 01 AAAACATCCT CCTTAGGTTG ATTACGAAAC ACGCCAACCA AGTATTTCGG AGTGCCTGAA 
4861 CTATTTTTAT ATGCTTTTAC AAGACTTGAA ATTTTC CTTG C AATAAC CGG GTCAATTGTT 
4 921 CTCTTTCTAT TGGGCACACA TATAATACCC AGCAAGTCAG CATCGGAATC TAGAGCACAT 

4 981 TCTGCGGCCT CTGTGCTCTG CAAGCCGCAA ACTTTCACCA ATGGAC CAGA ACTAC CTGTG 

5 041 AAATTAATAA CAGACATACT CCAAGCTGCC TTTGTGTGCT TAATCACGTA TACTCACGTG 
5101 CTCAATAGTC ACCAATGCCC TCCCTCTTGG CCCTCTCCTT TTCTTTTTTC GACCGAATTA 
5161 ATTCTTAATC GGCAAAAAAA GAAAAGCTCC GGATCAAGAT TGTACGTAAG GTGACAAGCT 
5221 ATTTTTCAAT AAAGAATATC TTCCACTACT GCCATCTGGC GTCATAACTG CAAAGTACAC 
's^Sl ATATATTACG ATGCTGTCTA TTAAATGCTT CCTATATTAT ATATATAGTA ATGTCGTTTA 
5 341 TGGTG CACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT TAAGCCAGCC CCGACACCCG 
54 01 CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC TTACAGACAA 
5 4 6 1 * GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAGGTTTT CACCGTCATC ACCGAAACGC 
5 521 G C GAG AC G AA AGGGCCTCGT GATACGCCTA TTTTTATAGG TTAATGTCAT GATAATAATG 
5581 GTTTCTTAGG ACGGATCGCT TGCCTGTAAC TTACACGCGC CTCGTATCTT TTAATGATGG 
5641 AATAATTTGG GAATTTACTC TGTGTTTATT TATTTTTATG TTTTGTATTT GGATTTTAGA 

57 01 AAGTAAATAA AGAAGGTAGA AGAGTTACGG AATGAAGAAA AAAAAATAAA CAAAGGTTTA 
5761 AAAAATTT C A AC AAAAAG C G TACTTTACAT ATATATTTAT TAGACAAGAA AAGCAGATTA 
5 821 AATAGATATA CATTCGATTA ACGATAAGTA AAATGTAAAA TCACAGGATT TTCGTGTGTG 

58 81 GTCTTCTACA CAGACAAGAT GAAACAATTC GGCATTAATA CCTGAGAGCA GGAAGAGCAA 

5 941 GATAAAAGGT AGTATTTGTT GGCGATCCCC CTAGAGTCTT TTACATCTTC GGAAAACAAA 

6 001 AACTATTTTT TCTTTAATTT CTTTTTTTAC TTTCTATTTT TAATTTATAT ATTTATATTA- 





6061 AAAAATTTAA ATTATAATTA TTTTTATAGC ACGTGATGAA AAGGACCCAG GTGGCACTTT 

lx\\ GTGCGCGGAA cccctatttg tttatttttc taaatacatt CAAATATGTA 

TCCGCTCATG AGACAATAAC C CTG ATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT 

H A gagSttcaa catttccgtg tcgcccttat tccctttttt GCGGCATTTT GCCTTCCTGT 
till SSgctcac CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT tgggtgcacg 
6361 agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 

6421 AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG 
till TATTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT 
6541 TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG 
6601 CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG 
6661 AGGACCGAAG GAGCTAACCG CTTTTTTT C A CAACATGGGG GATCATGTAA CTCGCCTTGA 
6721 TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT AC C AAACGAC GAGCGTGACA CCACGATGCC 
6781 TGTAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC 
6841 CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC 
6901 GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG 
6961 CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC 
7021 GACGGGCAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC 
lOBl ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 
7141 AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC 
72 01 CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA 

72 61 AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC 
7321 ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT 

73 81 AACTGGCTTC AGCAGAGCGC AG AT AC C AAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG 
7441 CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC 
7501 AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT 
7561 ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA 
7621 GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CATTGAGAAA GCGCCACGCT 
76 81 TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG 
7741 CACGAGGGAG CTTCCAGGGG GGAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA 
78 01 CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCCGAGCC TATGGAAAAA 
78 61 CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT 
7921 CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTG AG CTG A 
7981 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA 
8041 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 
8101 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTACCT 
8161 CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCCTATGT TGTGTGGAAT 
8221 TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC CATGATTACG CCAAGCTCGG 
82 81 AATTAACCCT CACTAAAGGG AACAAAAGCT GGGTACCGGG CCCCCCCTCG AGATCCGGGA 
8341 TCGAAGAAAT GATGGTAAAT GAAATAGGAA ATCAAGGAGC ATGAAGGCAA AAGACAAATA 
84 01 TAAGGGTCGA ACGAAAAATA AAGTGAAAAG TGTTGATATG ATGTATTTGG CTTTGCGGCG 
84 61 CCGAAAAAAC GAGTTTACGC AATTGCACAA TCATGCTGAC TCTGTGGCGG ACCCGCGCTC 
8521 TTGCCGGCCC GGCGATAACG CTGGGCGTGA GGCTGTGCCC GGCGGAGTTT TTTGCGCCTG 
8 581 CATTTTCCAA GGTTTACCCT GCGCTAAGGG GCGAGATTGG AGAAGCAATA AGAATGCCGG 
8 641 TTGGGGTTGC GATGATGACG ACCACGACAA CTGGTGTCAT TATTTAAGTT GCCGAAAGAA 
87 01 CCTGAGTGCA TTTG C AAC AT GAGTATACTA GAAGAATGAG CCAAGACTTG CGAGACGCGA 

' 8761 GTTTGCCGGT GGTGCGAACA AT AG AG C G AC CATGACCTTG AAGGTGAGAC GCGCATAACC 
8 821 GCTAGAGTAC TTTGAAGAGG AAAC AG C AAT AGGGTTGCTA C C AGTAT AAA TAGACAGGTA 
8 8 81 CATACAACAC TGGAAATGGT TGTCTGTTTG AGTACGCTTT CAA 





His6 carboxy-fusion vector, T7 promoter^ 
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256 



ate ccg cga aat 
tag ggc get tta 



11 Prffr^mfir 



gat eta dter ttc aaa^at gtt tjtt teg act tgc tot ttg cat ttt act 



eta gat dac aag ttt 



taa tac gac tea eta tag gga gate cac aac ggt ttc cct 
act atg ctg agt gat atta cdt ctg ^|y^-^^ 



gta caa aaa age tga acg aga aac gta aaa tga tat 



Of 



/A 



Odd E 



v/ 



1838 



1939 



1990 



ttt tta tgc aaa ate taa ttt aat 
aaa aat acg ttt tag att aaa tta 

//q**R2- . tr Y K 



tct cgt tea get t't jc ~ttg tac aaa 
-,aga gca agt cga aag aac atg| ttt 

A H H_ H H Z P EV 6^ 

cat cab cat cac etc gat ga?y calr 

^gta gtg__gta__gtg^ gag eta etc gtt 



ata ttg ata 
tat aac tat 

V V , X 
att 
taa 



gtg gtg 
cac cac 

taa eta gca 
att gat cgt 



ttt ata tea 
aaa tat agt 

M € Y 

atg teg tac 
tac age atg 

taa ccc ctt 
att ggg gaa 



ttt tac gtt 
aaa atg caa 

Y H H 

tac cat cac 

ggg gee tct 
ccc egg aga 
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pDEST23 6264 bp 



Location (Base Nos . ) 
285 . . 161 
394 . . 1053 
1173 . . 1257 
1395 . . 1700 
1741 . . 1865 
1883 . . 1911 
2574 . . 3.434 
3583 . .4222 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

his6 

ampR 

ori 



1 TCTTCCCCAT CGGTGATGTC GGCGATATAG GCGCCAGCAA CCGCACCTGT GGCGCCGGTG 
61 ATGCCGGCCA CGATGCGTCC GGCGTAGAGG ATCGAGATCT CGATCCCGCG AAATTAATAC 
121 GACTCACTAT AGGGAGACCA CAACGGTTTC CCTCTAGATC ACAAGTTTGT ACAAAAAAGC 
181 TGAACGAGAA ACGTAAAATG ATATAAATAT CAATATATTA AATTAGATTT TGCATAAAAA 
241 ACAGACTACA TAATACTGTA AAACACAACA TATCCAGTCA CTATGGCGGC CGCATTAGGC 
3 01 ACCCCAGGCT TTACACTTTA TGCTTCCGGC TCGTATAATG TGTGGATTTT GAGTTAGGAT 
361 CCGGCGAGAT TTTCAGGAGC TAAGGAAGCT AAAATGGAGA AAAAAATCAC TGG AT AT AC C 
421 ACCGTTGATA TATCCCAATG GCATCGTAAA GAACATTTTG AGGCATTTCA GTCAGTTGCT 
481 CAATGTACCT AT AAC C AG AC CGTTCAGCTG GATATTACGG CCTTTTTAAA GACCGTAAAG 
Q 541 AAAAATAAGC ACAAGTTTTA TCCGGCCTTT ATTCACATTC TTGCCCGCCT GATGAATGCT 

CI 6 01 CATCCGGAAT TCCGTATGGC AATGAAAGAC GGTGAGCTGG TGATATGGGA TAGTGTTCAC 

yl 6 61 CCTTGTTACA CCGTTTTCCA TGAGCAAACT GAAACGTTTT CATCGCTCTG GAGTGAATAC 

fa-- 721 CACGACGATT TCCGGCAGTT TCTACACATA TATTCGCAAG ATGTGGCGTG TTACGGTGAA 

%i 781 AACCTGGCCT ATTTCCCTAA AGGGTTTATT GAGAATATGT TTTTCGTCTC AGCCAATCCC 

JZ 841 TGGGTGAGTT TCACCAGTTT TGATTTAAAC GTGGCCAATA TGGACAACTT CTTCGCCCCC 

j~p 9 01 GTTTTCACCA TGGGCAAATA TTATACGCAA GGCGACAAGG TGCTGATGCC GCTGGCGATT 

gi" 961 CAGGTTCATC ATGCCGTCTG TGATGGCTTC CATGTCGGCA GAATGCTTAA TGAATTACAA 

1021 CAGTACTGCG ATGAGTGGCA GGGCGGGGCG TAAACGCGTG GATCCGGCTT ACTAAAAGCC 
% z 10 81 AGATAACAGT ATGCGTATTT GCGCGCTGAT TTTTGCGGTA TAAGAATATA TACTGATATG 

;y: ii4i tatacccgaa gtatgtcaaa aagaggtgtg ctatgaagca gcgtattaca gtgacagttg 

S 12 01 acagcgacag ctatcagttg ctcaaggcat atatgatgtc aatatctccg gtctggtaag 

y 12 61 cacaaccatg cagaatgaag cccgtcgtct gcgtgccgaa cgctggaaag cggaaaatca 

Hf 13 21 ggaagggatg gctgaggtcg cccggtttat tgaaatgaac ggctcttttg ctgacgagaa 

S 13 81 cagggactgg tgaaatgcag tttaaggttt acacctataa aagagagagc cgttatcgtc 

O 1441 tgtttgtgga tgtacagagt gatattattg acacgcccgg gcgacggatg gtgatccccc 

15 01 TGGCCAGTGC ACGTCTGCTG TCAGATAAAG TCTCCCGTGA ACTTTACCCG GTGGTGCATA 

15 61 TCGGGGATGA AAGCTGGCGC ATGATGACCA CCGATATGGC CAGTGTGCCG GTCTCCGTTA 
1621 TCGGGGAAGA AGTGGCTGAT CTCAGCCACC GCGAAAATGA CATCAAAAAC GCCATTAACC 

16 81 TGATGTTCTG GGGAATATAA ATGTCAGGCT CCCTTATACA CAGCCAGTCT GCAGGTCGAC 
1741 CATAGTGACT GGATATGTTG TGTTTTACAG TATTATGTAG TCTGTTTTTT ATGCAAAATC 
1801 TAATTTAATA TATTGATATT TATATCATTT TACGTTTCTC GTTCAGCTTT CTTGTACAAA 
1861 GTGGTGATTA TGTCGTACTA CCATCACCAT CACCATCACC TCGATGAGCA AT AAC TAGCA 
1921 TAACCCCTTG GGGCCTCTAA ACGGGTCTTG AGGGGTTTTT TGCTGAAAGG AGGAACTATA 

'l^l TCCGGATATC CACAGGACGG GTGTGGTCGC CATGATCGCG TAGTCGATAG TGGCTCCAAG 

2 041 TAGCGAAGCG AGCAGGACTG GGCGGCGGCC AAAGCGGTCG GACAGTGCTC CGAGAACGGG 

2101 TGCGCATAGA AATTGCATCA ACGCATATAG CGCTAGCAGC ACGCCATAGT GACTGGCGAT 

2161* GCTGTCGGAA TGGACGATAT CCCGCAAGAG GCCCGGCAGT ACCGGCATAA CCAAGCCTAT 

22 21 GCCTACAGCA TCCAGGGTGA CGGTGCCGAG GATGACGATG AGCGCATTGT TAGATTTCAT 

2281 ACACGGTGCC TGACTGCGTT AGCAATTTAA CTGTGATAAA CTACCGCATT AAAGCTTATC 

2 341 GATGATAAGC TGTCAAACAT GAGAATTCTT GAAGACGAAA GGGCCTCGTG ATACGCCTAT 

24 01 TTTTATAGGT TAATGTCATG ATAATAATGG TTTCTTAGAC GTCAGGTGGC ACTTTTCGGG 

24 61 GAAATGTGCG CGGAACCCCT ATTTGTTTAT TTTTCTAAAT ACATTCAAAT ATGTATCCGC 
2 521 TCATGAGACA ATAACCCTGA TAAATGCTTC AATAATATTG AAAAAGGAAG AGTATGAGTA 

25 81 TTCAACATTT CCGTGTCGCC CTTATTCCCT TTTTTGCGGC ATTTTGCCTT CCTGTTTTTG 

2 641 CTCACCCAGA AACGCTGGTG AAAGTAAAAG ATGCTGAAGA TCAGTTGGGT GCACGAGTGG *" 





2 701 GTTACATCGA ACTGGATCTC AACAGCGGTA AGATCCTTGA GAGTTTTCGC CCCGAAGAAC 
2 761 GTTTTCCAAT GATGAGCACT TTTAAAGTTC TGCTATGTGG CGCGGTATTA TCCCGTGTTG 
2821 ACGCCGGGCA AGAGCAACTC GGTCGCCGCA TACACTATTC TCAGAATGAC TTGGTTGAGT 
2 881 ACTCACCAGT CACAGAAAAG CATCTTACGG ATGGCATGAC AGTAAGAGAA TTATGCAGTG 

2 941 CTGCCATAAC CATGAGTGAT AACACTGCGG CCAACTTACT TCTGACAACG ATCGGAGGAC 

3 001 CGAAGGAGCT AACCGCTTTT TTGCACAACA TGGGGGATCA TGTAACTCGC CTTGATCGTT 
3 061 GGGAACCGGA GCTGAATGAA GCCATACCAA ACGACGAGCG TGACACCACG ATGCCTGCAG 
3121 CAATGGCAAC AACGTTGCGC AAACTATTAA CTGGCGAACT ACTTACTCTA GCTTCCCGGC 
3181 AACAATTAAT AG AC TGGATG GAGGCGGATA AAGTTGCAGG ACCACTTCTG CGCTCGGCCC 
3241 TTCCGGCTGG CTGGTTTATT GCTGATAAAT CTGGAGCCGG TGAGCGTGGG TCTCGCGGTA 
3 301 TCATTGCAGC ACTGGGGCCA GATGGTAAGC CCTCCCGTAT CGTAGTTATC TACACGACGG 
3361 GGAGTCAGGC AACTATGGAT GAACGAAATA GACAGATCGC TGAGATAGGT GCCTCACTGA 
3421 TTAAGCATTG GTAACTGTCA GACCAAGTTT ACTCATATAT ACTTTAGATT GATTTAAAAC 
34 81 TTCATTTTTA ATTTAAAAGG ATCTAGGTGA AGATCCTTTT TGATAATCTC ATGACCAAAA 
3 541 TCCCTTAACG TGAGTTTTCG TTCCACTGAG CGTCAGACCC CGTAGAAAAG AT C AAAGGAT 
36 01 CTTCTTGAGA TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT GCAAACAAAA AAACCACCGC 
3 661 TACCAGCGGT GGTTTGTTTG CCGGATCAAG AGCTACCAAC TCTTTTTCCG AAGGTAACTG 
3 721 GCTTCAGCAG AG CGCAGATA CCAAATACTG TCCTTCTAGT GTAGCCGTAG TTAGGCCACC 
3 781 ACTTCAAGAA CTCTGTAGCA CCGCCTACAT ACCTCGCTCT GCTAATCCTG TTACCAGTGG 
3 841 CTGCTGCCAG TGGCGATAAG TCGTGTCTTA CCGGGTTGGA CTCAAGACGA TAGTTACCGG 
3 901 ATAAGGCGCA GCGGTCGGGC TGAACGGGGG GTTCGTGCAC ACAGCCCAGC TTGGAGCGAA 

3 961 CGACCTACAC CGAACTGAGA TACCTACAGC GTGAGCTATG AGAAAGCGCC ACGCTTCCCG 

4 021 AAGGGAGAAA GGCGGACAGG TATCCGGTAA GCGGCAGGGT CGGAACAGGA GAGCGCACGA 
4 081 GGGAGCTTCC AGGGGGAAAC GCCTGGTATC TTTATAGTCC TGTCGGGTTT CGCCACCTCT 
4141 GACTTGAGCG TCGATTTTTG TGATGCTCGT CAGGGGGGCG GAGCCTATGG AAAAACGCCA 

42 01 GCAACGCGGC CTTTTTACGG TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC ATGTTCTTTC 
4261 CTGCGTTATC CCCTGATTCT GTGGATAACC GTATTACCGC CTTTGAGTGA GCTGATACCG 

43 21 CTCGCCGCAG CCGAACGACC GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG GAAGAGCGCC 
43 81 TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC ACACCGCATA TATGGTGCAC 
4441 TCTCAGTACA ATCTGCTCTG ATGCCGCATA GTTAAGC CAG TATACACTCC GCTATCGCTA 
4501 CGTGACTGGG TCATGGCTGC GCCCCGACAC CCGCCAACAC CCGCTGACGC GCCCTGACGG 
4 561 GCTTGTCTGC TCCCGGCATC CGCTTACAGA CAAGCTGTGA CCGTCTCCGG GAGCTGCATG 
4621 TGTCAGAGGT TTTCACCGTC ATCAC CGAAA CGCGCGAGGC AGCTGCGGTA AAGCTCATCA 
46 81 GCGTGGTCGT GAAGCGATTC ACAGATGTCT GCCTGTTCAT CCGCGTCCAG CTCGTTGAGT 
4741 TTCTCCAGAA GCGTTAATGT CTGGCTTCTG ATAAAGCGGG CCATGTTAAG GGCGGTTTTT 
48 01 TCCTGTTTGG TCACTGATGC CTCCGTGTAA GGGGGATTTC TGTTCATGGG GGTAATGATA 
4 861 CCGATGAAAC GAGAGAGGAT GCTCACGATA CGGGTTACTG ATGATGAACA TGCCCGGTTA 
4 921 CTGGAACGTT GTGAGGGTAA ACAACTGGCG GTATGGATGC GGCGGGACCA GAGAAAAATC 

4 981 ACTCAGGGTC AATGCCAGCG CTTCGTTAAT ACAGATGTAG GTGTTCCACA GGGTAGCCAG 

5 041 CAGCATCCTG CGATGCAGAT CCGGAACATA ATGGTGCAGG GCGCTGACTT CCGCGTTTCC 
5101 AGACTTTACG AAACACGGAA ACCGAAGACC ATTCATGTTG TTGCTCAGGT CGCAGACGTT 
5161 TTGCAGCAGC AGTCGCTTCA CGTTCGCTCG CGTATCGGTG ATTCATTCTG CTAACCAGTA 
5221 AGGCAACCCC GCCAGCCTAG CCGGGTCCTC AACGACAGGA GCACGATCAT GCGCACCCGT 
52 81 GGCCAGGACC CAACGCTGCC CGAGATGCGC CGCGTGCGGC TGCTGGAGAT GGCGGACGCG 
5341 ATGGATATGT TCTGCCAAGG GTTGGTTTGC GCATTCACAG TTCTCCGCAA GAATTGATTG 

'54 01 GCTCCAATTC TTGGAGTGGT GAATCCGTTA GCGAGGTGCC GCCGGCTTCC ATTCAGGTCG 
5461 AGGTGGCCCG GCTCCATGCA CCGCGACGCA ACGCGGGGAG GCAGACAAGG TATAGGGCGG 
5521 CGCCTACAAT CCATGCCAAC CCGTTCCATG TGCTCGCCGA GGCGGCATAA ATCGCCGTGA 
5 581 GGATCAGCGG TCCAGTGATC GAAGTTAGGC TGGTAAGAGC CGCGAGCGAT CCTTGAAGCT 
5641 GTCCCTGATG GTCGTCATCT ACCTGCCTGG ACAGCATGGC CTGCAACGCG GGCATCCCGA 
57 01 TGCCGCCGGA AGCGAGAAGA ATCATAATGG GGAAGGCCAT CCAGCCTCGC GTCGCGAACG 
5761 CCAGCAAGAC GTAGCCCAGC GCGTCGGCCG CCATGCCGGC GATAATGGCC TGCTTCTCGC 
5 821 CGAAACGTTT GGTGGCGGGA CCAGTGACGA AGGCTTGAGC GAGGGCGTGC AAGATTCCGA 
5881 ATACCGCAAG CGACAGGCCG ATCATCGTCG CGCTCCAGCG AAAGCGGTCC TCGCCGAAAA 

5 941 TGACCCAGAG CGCTGCCGGC ACCTGTCCTA CGAGTTGCAT GATAAAGAAG ACAGTCATAA 

6 001 GTGCGGCGAC GATAGTCATG CCCCGCGCCC ACCGGAAGGA GCTGACTGGG TTGAAGGCTC 
6 061 TCAAGGGCAT CGGTCGATCG ACGCTCTCCC TTATGCGACT C C TG C AT TAG GAAGCAGCCC 
6121 AGTAGTAGGT TGAGGCCGTT GAGCACCGCC GCCGCAAGGA ATGGTGCATG CAAGGAG ATG - 





6181 GCGCCCAACA GTCCCCCGGC CACGGGGCCT GCCACCATAC CCACGCCGAA ACAAGCGCTC 
6241 ATGAGCCCGA AGTGGCGAGC CCGA 
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GST carboxy-fusion vector, T7 promoter 



ate gag ate teg ate ccg cga aat 
tag etc tag age tag ggc get tta 



7 1 ^p^^- 



taa tac gaC tea eta t 

att atg ctg agt gat a 




52 



aac ggt ttc cct eta gat 
ttg cca aag gga gat eta 



gac cac 
ctg gtg 



ic aa<T cxt |crta caa aa a age tga acg aga aac 
tg tte aaa cat gtt 



it 



l|tt teg act tgc tct. ttg f { 



/A 



■y/ 



X735 tea £ 

yt aaa atg caa 



^ f c^gfch fact cgt tea ' "get tticJEg 

^ juvi-i n TS>-r a a ^ 



Tal 



K V V 



c aaa gtg g 



17QS 



aga gea acrt c ga aa*a aa~n »rgi t-ht cac cab 
tat So aaa Att aag ggc ctt gtg caa ccc 



X M 5 

att atg tec 
taa tac agg 



oLt atk eta aat tat tig aaa' att aag 'ggc ctt gtg caa ccc act cga ctt 
gga til g" cS ata all ttt taa ttc ccg gaa cac gtt ggg tg» get gaa 




pDEST24 6961 bp 

r^ at -^ m aH e Noa.) Gene Encoded 



195. .71 
304 . . 963 



attRl 
CmR 



1083.. 1167 inactivated ccdA 

1305.. 1610 ccdB 

1651. .1775 attR2 

1783 . .2451 GST 

3181.. 4041 ampR 

4190.. 4829 ori 

1 ATCGAGATCT CGATCCCGCG AAATTAATAC GACTCACTAT AGGGAGACCA CAACGGTTTC 
A SSagI^c ACAAGTTTGT ACAAAAAAGC TGAACGAGAA ACGTAAAATG ATATAAATAT 

,11 caaSSSta JSraLrar tgcataaaaa acagactaca taatactgta aaacacaaca 
i8i tSccaSca ™gcggc cgcattaggc accccaggct ttacacttta tgcttccggc 
2I1 ™tSaatg~ SLgatttt gagttaggat ccggcgagat tttcaggagc taaggaagct 
III aaaatggaga aaaaaatcac tggatatacc accgttgata tatcccaatg gcatcgtaaa 

36^ SSSSSS AGGCATTTCA GTCAGTTGCT CAATGTACCT ATAACCAGAC CGTTCAGCTG 
III GATATTACGG JcStTAAA GACCGTAAAG AAAAATAAGC ACAAGTTTTA TCCGGCCTTT 

481 a^ScaSc Sgcccgcct gatgaatgct catccggaat tccgtatggc aatgaaagac 
til Sotgagctgg tgatatggga tagtgttcac ccttgttaca ccgttttcca tgagcaaact 
sol gaaacgSS catcgctctg gagtgaatac cacgacgatt tccggcagtt tctacacata 
eel ?a^cgcaag atgtggcgtg ttacggtgaa aacctggcct atttccctaa agggtttatt 
ill gagaatatgt ttttcgtctc agccaatccc tgggtgagtt tcaccagttt tgatttaaac 
III SgScaata tggacaactt cttcgccccc gttttcacca tgggcaaata ttatacgcaa 

III GGCGACAAGG TGCTGATGCC GCTGGCGATT CAGGTTCATC ATGCCGTCTG TGATGGCTTC 
901 CAtScSca SSgCTTAA TGAATTACAA CAGTACTGCG ATGAGTGGCA GGGCGGGGCG 
961 TAAACGCGTG GATCCGGCTT ACTAAAAGCC AGATAACAGT ATGCGTATTT GCGCGCTGAT 
rill SJJSSaiA TAAGAATATA TACTGATATG TATACCCGAA GTATGTCAAA AAGAGGTGTG 

llll SaSaagca gcgtattaca gtgacagttg acagcgacag ctatcagttg ctcaaggcat 
1141 atatgatgtc aatatctccg gtctggtaag cacaaccatg cagaatgaag cccgtcgtct 
llll Scotgccgaa Sctggaaag cggaaaatca ggaagggatg gctgaggtcg cccggtttat 

llll TGAAATGAAC GGCTCTTTTG CTGACGAGAA CAGGGACTGG TGAAATGCAG TTTAAGGTTT 

IcaISSSaa Sgagagagc cgttatcgtc tgtttgtgga tgtacagagt gatattattg 
llll aScgcSSg gcgacggatg gtgatccccc tggccagtgc acgtctgctg tcagataaag 

llll ScCCGTGA ACTTTACCCG GTGGTGCATA TCGGGGATGA AAGCTGGCGC ATGATGACCA 

llll ccgISSggc" Sagtgtgccg gtctccgtta tcggggaaga agtggctgat ctcagccacc 

1561 GCGAAA^GA CA^CAAAAAC GCCATTAACC TGATGTTCTG GGGAATATAA ATGTCAGGCT 
H A S^IcA CAGCCAGTCT GCAGGTCGAC CATAGTGACT GGATATGTTG TGTTTTACAG 
llll SSSgSg tctgtttttt ATGCAAAATC TAATTTAATA TATTGATATT TATATCATTT 
i74i ScSttSc gttcagcttt cttgtacaaa gtggtgatta tgtcccctat actaggttat 

llll TGGAAAATTA AGGGCCTTGT GCAACCCACT CGACTTCTTT TGGAATATCT TGAAGAAAAA. 

llll l^^Zc ATTTGTATGA GCGCGATGAA GGTGATAAAT GGCGAAACAA AAAGTTTGAA 

1921 SggSSgg AGTTTCCCAA TCTTCCTTAT TATATTGATG gtgatgttaa attaacacag 

illl TCTATGGCCA TCATACGTTA TATAGCTGAC AAGCACAACA TGTTGGGTGG TTGTCCAAAA 

loll gagStgcag agatttcaat gcttgaagga gcggttttgg atattagata cggtgtttcg 
llol SaaSgS? a^gtaaaga ctttgaaact ctcaaagttg attttcttag caagctacct 

llll GAAATGCTGA AAATGTTCGA AGATCGTTTA TGTCATAAAA CATATTTAAA TGGTGATCAT 

222M^SStC ctgacttcat gttgtatgac gctcttgatg ttgttttata CATGGACCCA 
2281 atSgcctgg ATGCGTTCCC aaaattagtt tgttttaaaa aacgtattga agctatccca 
llll JaaStgIta agtacttgaa atccagcaag tatatagcat ggcctttgca gggctggcaa 
2T0I gSacg?Sg gtggtggcga ccatcctcca aaatcggatc tggttccgcg tccatgggga 
2461 ?cc5gctgct aacaaagccc gaaaggaagc tgagttggct gctgccaccg ctgagcaata 
llll actagcataa ccccttgggg cctctaaacg ggtcttgagg ggttttttgc tgaaaggagg 
2581 aaSataSc" ggSatccac aggacgggtg tggtcgccat gatcgcgtag tcgatagtgg 

lell CTCCAAGTAG CGAAGCGAGC AGGACTGGGC GGCGGCCAAA GCGGTCGGAC AGTGCTCCGA- 





2701 GAACGGGTGC GCATAGAAAT TGCATCAACG CATATAGCGC TAGCAGCACG CCATAGTGAC 
276^ TGGCGATGCT GTCGGAATGG ACGATATCCC GCAAGAGGCC CGGCAGTACC GGCATAACCA 
2821 IgccSScc TACAGCATCC AGGGTGACGG TGCCGAGGAT GACGATGAGC ^CATTGTTAG 
llll A?SSScA CGGTGCCTGA CTGCGTTAGC AATTTAACTG TGATAAACTA CCGCATTAAA 
H A ScSSSS GATAAGCTGT CAAACATGAG AATTCTTGAA GACGAAAGGG CCTCGTGATA 
sooi cgcSISt Staggttaa TGTCATGATA ATAATGGTTT CTTAGACGTC AGGTGGCACT 
3061 SJcSggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg 
3121 ^cSSS tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt 
llll SgagStc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct 

3241 ™SSc ACCCAGAAAC GCTGGTGAAA GTAAAAGATG CTGAAGATCA GTTGGGTGCA 

llll cgISgggtt acatcgaact ggatctcaac AGCGGTAAGA tccttgagag ttttcgcccc 

3361 GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC 
3421 CGTGTTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 
llll g^gStact CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT AAGAGAATTA 
llll TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA ACTTACTTCT GACAACGATC 
llol GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG GGGATCATGT AACTCGCCTT 
3661 gSSSK AACCGGAGCT GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG 

37" cctcSgcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct 

3781 TCcSSJc" AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC ACTTCTGCGC 
3841 TCGGCCCTTC SgCTGGCTG GTTTATTGCT GATAAATCTG GAGCCGGTGA GCGTGGGTCT 

llol cgcggtSca ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac 
llll acgacgSgga gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc 
4021 StgaSa agcattggta actgtcagac caagtttact catatatact ttagattgat 
till Saaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg 
Ic^Stcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc 

llol AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA 

llll cca?cgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag 
till gta!c?ggct tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta 

till GGC^ACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA 

llll ccaSggotg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag 
till tSccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg 
till gScgSSa cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg 

till CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG 
tell CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC 
till CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA 
till AACGCCAGcl ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG 
till SStScS CGTTATCCCC TGATTCTGTG GATAACCGTA TTACCGCCTT TGAGTGAGCT 

till gaSccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa 
till SgJgcSga tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcatatat 

sill GGTGCACTCT CAGTACAATC TGCTCTGATG CCGCATAGTT AAGCCAGTAT ACACTCCGCT 
llol ScgSacgt GACTGGGTCA TGGCTGCGCC CCGACACCCG CCAACACCCG CTGACGCGCC 

5^1 SgacSgS Stctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag 

5221 CTGCATGTGT CAGAGGTTTT CACCGTCATC ACCGAAACGC GCGAGGCAGC TGCGGTAAAG 
llll ScScAGCG TGGTCGTGAA GCGATTCACA GATGTCTGCC TGTTCATCCG CGTCCAGCTC 

llll gtSag^tc tccagaagcg ttaatgtctg gcttctgata AAGCGGGCCA TGTTAAGGGC 

llll GGTTTTTTCC TGTTTGGTCA CTGATGCCTC CGTGTAAGGG GGATTTCTGT TCATGGGGGT 
5461 AATGATACCG ATGAAACGAG AGAGGATGCT CACGATACGG GTTACTGATG ATGAACATGC 
5521 CCGGTTACTG GAACGTTGTG AGGGTAAACA ACTGGCGGTA TGGATGCGGC GGGACCAGAG 
5581 AAAAAT C AC T CAGGGTCAAT GCCAGCGCTT CGTTAATACA GATGTAGGTG TTCCACAGGG 
sI^tagISagJag CATCCTGCGA TGCAGATCCG GAACATAATG GTGCAGGGCG CTGACTTCCG 

I'll cgISSSa ctttacgaaa cacggaaacc gaagaccatt catgttgttg ctcaggtcgc 

Sill AGACGTTTTG CAGCAGCAGT CGCTTCACGT TCGCTCGCGT ATCGGTGATT CATTCTGCTA 

llll aSaotaagg caaccccgcc agcctagccg ggtcctcaac gacaggagca cgatcatgcg 
llll ctcSSS caggacccaa cgctgcccga gatgcgccgc gtgcggctgc tggagatggc 
llll ggacgcgatg gatatgttct gccaagggtt ggtttgcgca ttcacagttc tccgcaagaa 

60" TTGATTGGCT CCAATTCTTG GAGTGGTGAA TCCGTTAGCG AGGTGCCGCC GGCTTCCATT 

loll cIgotcgagg tggcccggct ccatgcaccg cgacgcaacg cggggaggca gacaaggtat 

6l" AGGGCGGCGC CTACAATCCA TGCCAACCCG TTCCATGTGC TCGCCGAGGC GGCATAAATC 



6181 GCCGTGACGA TCAGCGGTCC AGTGATCGAA GTTAGGCTGG TAAGAGCCGC GAGCGATCCT 
6241 TGAAGCTGTC CCTGATGGTC GTCATCTACC TGCCTGGACA GCATGGCCTG CAACGCGGGC 
6 3 01 ATCCCGATGC CGCCGGAAGC GAGAAGAATC ATAATGGGGA AGGCCATCCA GCCTCGCGTC 
6361 GCGAACGCCA GCAAGACGTA GCCCAGCGCG TCGGCCGCCA TGCCGGCGAT AATGGCCTGC 
6421 TTCTCGCCGA AACGTTTGGT GGCGGGACCA GTGACGAAGG CTTGAGCGAG GGCGTGCAAG 
6481 ATTCCGAATA CCGCAAGCGA CAGGCCGATC ATCGTCGCGC TCCAGCGAAA GCGGTCCTCG 
6 541 CCGAAAATGA CCCAGAGCGC TGCCGGCACC TGTCCTACGA GTTGCATGAT AAAGAAGACA 
6 6 01 GTCATAAGTG CGGCGACGAT AGTCATGCCC CGCGCCCACC GGAAGGAGCT GACTGGGTTG 
6661 AAGGCTCTCA AGGGCATCGG TCGATCGACG CTCTCCCTTA TGCGACTCCT GCATTAGGAA 
6721 GCAGCCCAGT AGTAGGTTGA GGCCGTTGAG CACCGCCGCC GCAAGGAATG GTGCATGCAA 
67 81 GGAGATGGCG CCCAACAGTC CCCCGGCCAC GGGGCCTGCC ACCATACCCA CGCCGAAACA 
6841 AGCGCTCATG AGCCCGAAGT GGCGAGCCCG ATCTTCCCCA TCGGTGATGT CGGCGATATA 
6 901 GGCGCCAGCA ACCGCACCTG TGGCGCCGGT GATGCCGGCC ACGATGCGTC CGGCGTAGAG 
6961 G 





Thioredoxin carboxy-fiision vector, T7 promoter 
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Location (Base Nos . ) 
844 . . 720 
953. .1612 
1732 . . 1816 
1954 . .2259 
2300 . . 2424 
2432 . .2794 



Gene Encoded 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

trx 



1 CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGC GAACGCCAGC 
61 AAGACGTAGC CCAGCGCGTC GGCCGCCATG CCGGCGATAA TGGCCTGCTT CTCGCCGAAA 
121 CGTTTGGTGG CGGGACCAGT GACGAAGGCT TGAGCGAGGG CGTGCAAGAT TCCGAATACC 
181 GCAAGCGACA GGCCGATCAT CGTCGCGCTC CAGCGAAAGC GGTCCTCGCC GAAAATGACC 
241 CAGAGCGCTG CCGGCACCTG TCCTACGAGT TGCATGATAA AGAAGACAGT CATAAGTGCG 

3 01 GCGACGATAG TCATGCCCCG CGCCCACCGG AAGGAGCTGA CTGGGTTGAA GGCTCTCAAG 
36^ GGCATCGGTC GATCGACGCT CTCCCTTATG CGACTCCTGC ATTAGGAAGC AGCCCAGTAG 
421 TAGGTTGAGG CCGTTGAGCA CCGCCGCCGC AAGGAATGGT GCATGCAAGG AGATGGCGCC 

4 81 CAACAGTCCC CCGGCCACGG GGCCTGCCAC CATACCCACG CCGAAACAAG CGCTCATGAG 
541 CCCGAAGTGG CGAGCCCGAT CTTCCCCATC GGTGATGTCG GCGATATAGG CGCCAGCAAC 

6 01 CGCACCTGTG GCGCCGGTGA TGCCGGCCAC GATGCGTCCG GCGTAGAGGA TCGAGATCTC 
66^ GATCCCGCGA AATTAATACG ACTCACTATA GGGAGACCAC AACGGTTTCC CTCTAGATCA 
721 CAAGTTTGTA CAAAAAAGCT GAACGAGAAA CGTAAAATGA T AT AAAT AT C AATATATTAA 

7 81 ATTAGATTTT GCATAAAAAA CAGACTACAT AATACTGTAA AACACAACAT ATCCAGTCAC 
841 TATGGCGGCC GCATTAGGCA CCCCAGGCTT TACACTTTAT GCTTCCGGCT CGTATAATGT 
901 GTGGATTTTG AGTTAGGATC CGGCGAGATT TTCAGGAGCT AAGGAAGCTA AAATGGAGAA 
961 AAAAATCACT GG AT AT AC C A CCGTTGATAT ATCCCAATGG CATCGTAAAG AACATTTTGA 

1021 GGCATTTCAG TCAGTTGCTC AATGTACCTA TAACCAGACC GTTCAGCTGG ATATTACGGC 
llll CTTTTTAAAG ACCGTAAAGA AAAATAAGCA CAAGTTTTAT CCGGCCTTTA TTCACATTCT 
1141 TGCCCGCCTG ATGAATGCTC ATCCGGAATT CCGTATGGCA ATGAAAGACG GTGAGCTGGT 
1201 GATATGGGAT AGTGTTCACC CTTGTTACAC CGTTTTCCAT GAGCAAACTG AAACGTTTTC 
12 61 ATCGCTCTGG AGTGAAT AC C ACGACGATTT CCGGCAGTTT CTACACATAT ATTCGCAAGA 
1321 TGTGGCGTGT TACGGTGAAA ACCTGGCCTA TTTCCCTAAA GGGTTTATTG AGAATATGTT 
1381 TTTCGTCTCA GCCAATCCCT GGGTGAGTTT CACCAGTTTT GATTTAAACG TGGCCAATAT 
1441 GGACAACTTC TTCGCCCCCG TTTTC AC CAT GGGCAAATAT TATACGCAAG GCGACAAGGT 
1501 GCTGATGCCG CTGGCGATTC AGGTTCATCA TGCCGTCTGT GATGGCTTCC ATGTCGGCAG 
1561 AATGCTTAAT GAATTACAAC AGTACTGCGA TGAGTGGCAG GGCGGGGCGT AAACGCGTGG 
III! ATCCGGCTTA CTAAAAGCCA GATAACAGTA TGCGTATTTG CGCGCTGATT TTTGCGGTAT 
16 81 AAGAATATAT ACTGATATGT ATACCCGAAG TATGTCAAAA AGAGGTGTG C TATGAAGCAG 
1741 CGTATTACAG TGACAGTTGA CAGCGACAGC TATCAGTTGC TCAAGGCATA TATGATGTCA 
1801 ATATCTCCGG TCTGGTAAGC ACAACCATGC AGAATGAAGC CCGTCGTCTG CGTGCCGAAC 
18 61 GCTGGAAAGC GGAAAATCAG GAAGGGATGG CTGAGGTCGC CCGGTTTATT GAAATGAACG 
1921 GCTCTTTTGC TGACGAGAAC AGGGACTGGT GAAATGCAGT TTAAGGTTTA CACCTATAAA 
1981 AGAGAGAGCC GTTATCGTCT GTTTGTGGAT GTACAGAGTG ATATTATTGA CACGCCCGGG 
2 041 CGACGGATGG TGATCCCCCT GGCCAGTGCA CGTCTGCTGT CAGATAAAGT CTCCCGTGAA 
'2101 CTTTACCCGG TGGTGCATAT CGGGGATGAA AGCTGGCGCA TGATGACCAC CGATATGGCC 
2161 AGTGTGCCGG TCTCCGTTAT CGGGGAAGAA GTGGCTGATC TCAGCCACCG CGAAAATGAC 
2221 ATCAAAAACG CCATTAACCT GATGTTCTGG GGAATATAAA TGTCAGGCTC CCTTATACAC 
22 81 AGCCAGTCTG CAGGTCGACC ATAGTGACTG GATATGTTGT GTTTTACAGT ATTATGTAGT 
2341 CTGTTTTTT A TGCAAAATCT AATTTAATAT ATTGATATTT ATATCATTTT ACGTTTCTCG 
2401 TTCAGCTTTC TTGTACAAAG TGGTGATTAT GAGCGATAAA ATTATTCACC TGACTGACGA 
2461 CAGTTTTGAC ACGGATGTAC TCAAAGCGGA CGGGGCGATC CTCGTCGATT TCTGGGCAGA 
2 521 GTGGTGCGGT CCGTGCAAAA TGATCGCCCC GATTCTGGAT GAAATCGCTG ACGAATATCA 
2581 GGGCAAACTG ACCGTTGCAA AACTGAACAT CGAT C AAAAC CCTGGCACTG CGCCGAAATA 
2 641 TGGCATCCGT GGTATCCCGA CTCTGCTGCT GTTCAAAAAC GGTGAAGTGG CGGCAACCAA 
2 7 01 AGTGGGTGCA CTGTCTAAAG GTCAGTTGAA AGAGTTCCTC GACGCTAACC TGGCCGGTTC 
2 761 TGGTTCTGGT GATGACGATG ACAAGGTACC CGGGGATCGA TCCGGCTGCT AACAAAGCCC 




2 821 GAAAGGAAGC TGAGTTGGCT GCTGCCACCG CTGAGCAATA ACTAGCATAA CCCCTTGGGG 
2 881 CCTCTAAACG GGTCTTGAGG GGTTTTTTGC TGAAAGGAGG AACTATATCC GG AT AT C C AC 

2 941 AGGACGGGTG TGGTCGCCAT GATCGCGTAG TCGATAGTGG CTCCAAGTAG CGAAG CGAGC 

3 001 AGGACTGGGC GGCGGCCAAA GCGGTCGGAC AGTGCTCCGA GAACGGGTGC GCATAGAAAT 
3 061 TGCATCAACG CATATAGCGC TAGCAGCACG CCATAGTGAC TGGCGATGCT GTCGGAATGG 
3121 ACGATATCCC GCAAGAGGCC CGGCAGTACC GGCATAACCA AGCCTATGCC TACAGCATCC 
3181 AGGGTGACGG TGCCGAGGAT GACGATGAGC GCATTGTTAG ATTTCATACA CGGTGCCTGA 
3 241 CTGCGTTAGC AATTTAACTG TGATAAACTA CCGCATTAAA GCTTATCGAT GATAAGCTGT 
3 301 CAAACATGAG AATTCTTGAA GACGAAAGGG CCTCGTGATA CGCCTATTTT TATAGGTTAA 
3361 TGTCATGATA ATAATGGTTT CTTAGACGTC AGGTGGCACT TTTCGGGGAA ATGTGCGCGG 
3421 AACCCCTATT TGTTTATTTT T CTAAATAC A TTCAAATATG TATCCGCTCA TGAGACAATA 
3481 AC C CTGATAA ATGCTTCAAT AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG 
3 541 TGTCGCCCTT ATTCCCTTTT TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC 
3 6 01 GCTGGTGAAA GTAAAAGATG CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT 
3 661 GGATCTCAAC AG CGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 
3 721 GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG CCGGGCAAGA 
3 7 81 GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG GTTGAGTACT CACCAGTCAC 
3 841 AGAAAAGCAT CTTACGGATG GCATGACAGT AAGAGAATTA TGCAGTGCTG C C AT AAC CAT 

3 901 GAGTGATAAC ACTGCGGCCA ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC 
3961 CGCTTTTTTG CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 
4021 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA TGGCAACAAC 

O 4 081 GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT TCCCGGCAAC AATTAATAGA 

yO 4141 CTGGATGGAG GCGGATAAAG TTGCAGGACC ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG 

yl 42 01 GTTTATTGCT GATAAATCTG GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT 

U s 42 61 GGGGCCAGAT GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 

SI 43 21 TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA AGCATTGGTA 

43 81 ACTGTCAGAC CAAGTTTACT CATATATACT TTAGATTGAT TTAAAACTTC ATTTTTAATT 
g~ 4441 TAAAAGGATC TAGGTGAAGA TCCTTTTTGA TAATCTCATG ACCAAAATCC CTTAACGTGA 

~ 4 5 01 GTTTTCGTTC CACTGAGCGT CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC 

4 561 TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT 
:L 4621 TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC 
J** 4681 GCAGATACCA AATACTGTCC TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC 

4 741 TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG 
^ 4 8 01 CGATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG 

4861 GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA 
O 4 921 ACTGAGATAC CTACAGCGTG AGC TATGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC 

O 4 981 GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG 

5 041 GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG 
5101 ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT 
5161 TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC 
5221 TGATTCTGTG GATAAC CGTA TTACCGCCTT TGAGTGAGCT GATACCGCTC GCCGCAGCCG 
52 81 AACGACCGAG CGCAGCGAGT CAGTGAGCGA GGAAGCGGAA GAGCGCCTGA TGCGGTATTT 
5341 TCTCCTTACG CATCTGTGCG GTATTTCACA CCGCATATAT GGTGCACTCT CAGTACAATC 

54 01 TGCTCTGATG CCGCATAGTT AAGCCAGTAT ACACTCCGCT ATCGCTACGT GACTGGGTCA 
5461 TGGCTGCGCC CCGACACCCG CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC 
'552 1 CGGCATCCGC TTACAGACAA GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAGGTTTT 

55 81 CACCGTCATC ACCGAAACGC GCGAGGCAGC TGCGGTAAAG CTCATCAGCG TGGTCGTGAA 
5641 GCGATTCACA GATGTCTGCC TGTTCATCCG CGTCCAGCTC GTTGAGTTTC TCCAGAAGCG 
5 7 01^ TTAATGTCTG GCTTCTGATA AAGCGGGCCA TGTTAAGGGC GGTTTTTTCC TGTTTGGTCA 
5761 CTGATGCCTC CGTGTAAGGG GGATTTCTGT TCATGGGGGT AATGATACCG ATGAAACGAG 
5 821 AGAGGATGCT CACGATACGG GTTACTGATG ATGAACATGC CCGGTTACTG GAACGTTGTG 
58 81 AGGGTAAACA ACTGGCGGTA TGGATGCGGC GGG AC C AG AG AAAAATCACT CAGGGTCAAT 

5 941 GCCAGCGCTT CGTTAATACA GATGTAGGTG TTCCACAGGG TAGCCAGCAG CATCCTGCGA 
6001 TGCAGATCCG GAACATAATG GTGCAGGGCG CTGACTTCCG CGTTTCCAGA CTTTACGAAA 

6 061 CACGGAAACC GAAGACCATT CATGTTGTTG CTCAGGTCGC AGACGTTTTG CAGCAGCAGT 
6121 CGCTTCACGT TCGCTCGCGT ATCGGTGATT CATTCTGCTA ACCAGTAAGG CAACCCCGCC 
6181 AGCCTAGCCG GGTCCTCAAC GACAGGAGCA CG AT CAT G C G CACCCGTGGC CAGGACCCAA 
6241 CGCTGCCCGA GATGCGCCGC GTGCGGCTGC TGGAGATGGC GGACGCGATG GATATGTTCT * — 



6301 GCCAAGGGTT GOTTTGCGCA *™T= TCC™ CC^TTCTTa 
6361 GAGTGGTGAA TCCGTTAGCG ^GCCGCC AGGGCGGCGC CTACAATCCA 

6421 CCATGCACCG CGACGCAACG ™™ GCCGTGACGA TCAGCGGTCC 

6481 TGCCAACCCG TTCCATGTGC ^CGCCOMK TGAAGCTGTC CCTGATGGTC 

GTCATCTACC E£E£ ™- SES»C AXCCCGA^C CG 



pDEST26 His6 Amino Fusion in pCMV Sport-neo 

Vector 

600 S2 S S 3 £ £ SS 1 3 4 s - s s is s s ss 

~ s s a g s a s g.JS 5 = 35 = = 5 5gL 



753 



709 ^Wt'gss a» tct ata taa gca gaf etc gtt tag tga acc g;c 
,,cca ccc tec aga tat att cgt etc ga g caa ate act tgg c^g 



aga 
tct age gga 



S 55 S S SS IS £2 3S SE S3 55 S S S f 

ggt egg agg oct gag ate gga tec gac K A /kffTZtj / 

■ i »_i i_L 1-4 V "S i — < — / H UL- „" Jr^t- n 3a i-r»a aaa 7 



855 



ggn ugy oyy — - Vr ^ f , l/ K ^ t *Y* i 

& & & & & ?! itfe^llsl; 



BspLU11 I 7092 
Aff III 7092 



Sst\ 518 
SnaB1292 1 Bbs I 581 
Ssf II 623 
Xba I 662 
Not \ 796 



Eam1105 I 6199 



Xmn I 5600 



Nsp V 4909 
R$r II 4743 




Bg/ll 1564 
Bsf1107 I 1650 



S/f I 1985 
Aval 1986 
Sma I 1986 
Sal I 2245 



Me t 2475 



Tth^^^ i 4343 

Nar! 4227 
Afl II 4082 
H/nd III 4050 
Stu I 4031 
SseR 14015 
Sff I 3980 



Be/ 1 2751 
Mun I 2844 
Hpa I 2855 



Dra III 3346 
Kpn I 3684 

SexA I 3801 



pDEST26 7481 bp 



T.onatio^ ( Rasft Nos.) 
492 . . 509 
619. .519 
752. .1411 
1531. .1615 
1753 . .2058 
2099. .2223 
2409. .2771 
2966 . .3421 
3485. .3903 
3948 . .4742 
4806 . .4854 
5265. .6125 
6274. .6913 
7344 . .385 



Hpne Encoded 

his6 

attRl 

CmR 

inactivated ccdA 

ccdB 

attR2 

SV40 polyA 

fl intergenic region 

SV4 0 promoter 

neo 

polyA 

Apr 

ori 

CMV promoter 



1 GTAAACTGCC CACTTGGCAG G T A_ C = OC CCC = 

61 CGTCAATGAC GGTAAATGGC CCGCCTGGCA ^GCCCAG ^ TGCGGTTTTG 

121 TCCTACTTGG CAGTACATCT ^GTATTAGT CATCGCT GGATTTCCAA GTCTCCACCC 

181 GCAGTACATC AATGGGCGTG GJTAGCGGTT TGACTCAC cqgGACTTTC CAAAATGTCG 
241 CATTGACGTC AATGGGAGTT TGTTTTGGCA CCAAAATCAA AGGT CT ATAT 

301 TAACAACTCC GCCCCATTGA CGCAAATGGG CGGTAGGCGT ^G^ GCTGTTTTGA 
361 AAGCAGAGCT CGTTTAGTGA ACCGTCAGAT CGCCTGGA^ CCGCGGACCA 
421 CCTCCATAGA ^ACACCGGG ^CGATCCAG ^ttGTAC AAAAAAGCTG 

481 TGGCGTACTA CCATCACCAT CAJCATCACT J™ TTAGATTTTG CATAAAAAAC 

541 AACGAGAAAC GTAAAATGAT ATAAATATCA ATATATTAAA CATTAGGCAC 
601 AGACTACATA ATACTGTAAA JCACAACATA TCCAGTCACT ^ GTTAGGATCC 

661 CCCAGGCTTT ACACTTTATG CTTCCGGCTC AAAATCACTG GATATACCAC 

721 GGCGAGATTT TCAGGAGCTA AGGAAGCTAA AWTOGAGAAA QCATTTCAGT CAGTTGCTCA 
781 CGTTGATATA TCCCAATGGC JTCGTAAAGA JCATTTTGAG GC ^ 
841 ATGTACCTAT ^ACCAGACCG TTCAGCTGGA TATTACG gcccgcctga TGAATGCTCA 
• 901 AAATAAGCAC AAGTTTTATC C ^CCTTTAT JCACA ATATGGGATA GTGTTCACCC 

961 TCCGGAATTC CGTATGGCAA TGAAAGACGG TGAGCTGGTG A GTGAATACCA 
1021 TTGTTACACC GTTTTCCATG JGCAAACTGA AACGTTTTCA JCGC A cGGTGAAAA 
1081 CGACGATTTC CGGCAGTTTC TACACATATA ^GCM^AT CCAATCCCTG 
1141 CCTGGCCTAT TTCCCTAAAG ^TTATTGA ^^TTT ™ G TCGCCCCCGT 
1201 GGTGAGTTTC AC C AGTTTTG ATTTAAACGT GGCCAATATG CTGATGCCGC TGGCGATTCA 
1261 TTTCACCATG GGCAAATATT ATACGCAAGG JGACAAGGTG AATTACAACA 
1321 GGTTCATCAT GCCGTCTGTG JTGGCTTCCA ^GGCAGA ip AAAAGG CAG 
1381 GTACTGCGAT GAGTGGCAGG ^GGGGCGTA ^ AGAATATATA CTGATATGTA 

14 41 ATAACAGTAT GCGTATTTGC ^CGCTGATTT ^ GTATTACAGT GACAGTTGAC 

1501 TACCCGAAGT ATGTCAAAAA <^™3CT ^GAft TATCTCCGGT CTGGTAAGCA 

,1561 AGCGACAGCT ATCAGTTGCT C^GGCATAT JTGATGTCAA GftAAATCAGG 
1621 CAACCATGCA GAATGAAGCC CGTCGTCTGC ^ CTCTTTTGCT GACGAGAACA 

1681 AAGGGATGGC TGAGGTCGCC CGGTTTATTG £££££ GAGAG AGCCG TTATCGTCTG 
!741 GGGACTGGTG ££££££ ACGCCCGGGC GACGGATGGT GATCCCCCTG 

1801 - TTTGTGG ATG TACAGAGTGA TA TTAT1W£ TTTACCCGGT GGTGCATATC 

1861 GCCAGTGCAC GTCTGCTGTC J^AAAGTC ^CGTGAAC J CTCCGTTATC 
1921 GGGGATGAAA GCTGGCGCAT GATGACCACC TCAAAAACGC CATTAACCTG 

19 81 GGGGAAGAAG TGGCTGATCT CAGCCACCGC G ™^™ GCCAGTCTGC AGGTCGACCA 

2041 ATGTTCTGGG GAATATAAAT ^CAGGCTCC CTTATACACA. GCAAAATCTA 
2101 TAGTGACTGG ATATGTTGTG JTTTACAGTA TTATGTAGTC fG^ACAAAGT 
2161 ATTTAATATA TTGATATTTA ^ATCATTTTA CGTTT AGTGAGTCGT ATTATAAGCT 

aSaSggc SSSS cScg T T Sg Sgggaaaa C TGCT AGC TT GGGA TCTTT G 





2341 TGAAGGAACC TTACTTCTGT GGTGTGACAT AATTGGACAA ACTACCTACA ^AGATTTAAA 



2401 GCTCTAAGGT AAATATAAAA TTTTTAAGTG TATAATGTGT TAAACTAGCT GCATATGCTT 

276^ TCACAAATAA MCATTTTTT TCACTGCATT CTAGTTGTGG TTTGTCCAAA CTCATCAATG 

2821 
2881 
2941 



2461 GCTGCTTGAG AGTTTTGCTT ACTGAGTATG ATTTATGAAA ATATTATACA CAGGAGCTAG 
2641 TTGTTAACTT GTTTATOGCA GCTTATAATG GTTACAAATA AAGCAATAGC ATCACAAATT 



2 521 TGATTCTAAT TGTTTGTGTA TTTTAGATTC ACAGTCCCAA GGCTCATTTC AGGCCCCTCA 
2 581 GTCCTCACAG TCTGTTCATG ATCATAATCA GCCATACCAC ATTTGTAGAG GTTTTACTTG 
CTTTAAAAAA CCTCCCACAC CTCCCCCTGA ACCTGAAACA TAAAATGAAT GCAATTGTTG 



TATCTTATCA TGTCTGGATC GATCCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC 
GGTTTGCGTA TTGGCTGGCG TAATAGCGAA GAGGCCCGCA CCGATCGCCC TTCCCAACAG 
TTGCGCAGCC TGAATGGCGA ATGGGACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT 
3001 GTGGTGGTTA CGCGCAGCGT GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC 
3 061 GCTTTCTTCC CTTCCTTTCT CGCCACGTTC GCCGGCTTTC CCCGTCAAGC TCTAAATCGG 
3121 GGGCTCCCTT TAGGGTTCCG ATTTAGTGCT TTACGGCACC TCGACCCCAA AAAACTTGAT 
3181 TAGGGTGATG GTTCACGTAG TGGGCCATCG CCCTGATAGA CGGTTTTTCG CCCTTTGACG 
3241 TTGGAGTCCA CGTTCTTTAA TAGTGGACTC TTGTTCCAAA CTGGAACAAC ACTCAACCCT 
3 301 ATCTCGGTCT ATTCTTTTGA TTTATAAGGG ATTTTGCCGA TTTCGGCCTA TTGGTTAAAA 
3 3 61 AATGAGCTGA TTTAACAAAT ATTTAACGCG AATTTTAACA AAATATTAAC GTTTACAATT 
3421 TCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTC AC AC C GCATACGCGG 
34 81 ATCTGCGCAG CACCATGGCC TGAAATAACC TCTGAAAGAG GAACTTGGTT AGGTACCTTC 
3 541 TGAGGCGGAA AGAACCAGCT GTGGAATGTG TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC 
3601 TCCCCAGCAG GCAGAAGTAT GCAAAGCATG C ATCT C AATT AGTCAGCAAC CAGGTGTGGA 
3661 AAGTCCCCAG GCTCCCCAGC AGGCAGAAGT ATGCAAAGCA TGCATCTCAA TTAGTCAGCA 
3721 ACCATAGTCC CGCCCCTAAC TCCGCCCATC CCGCCCCTAA CTCCGCCCAG TTCCGCCCAT 
37 81 TCTCCGCCCC ATGGCTGACT AATTTTTTTT ATTTATGCAG AGGCCGAGGC CGCCTCGGCC 
3841 TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG 
3 901 CTTGATTCTT CTGACACAAC AGTCTCGAAC TTAAGGCTAG AGCCACCATG ATTGAACAAG 

3 961 ATGGATTGCA CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG 

4 021 CACAACAGAC AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC 
4 081 CGGTTCTTTT TGTCAAGACC GACCTGTCCG GTGCCCTGAA TGAACTGCAG GACGAGGCAG 
4141 CGCGGCTATC GTGGCTGGCC ACGACGGGCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA 
4201 CTGAAGCGGG AAGGGACTGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT 
4261 CTCACCTTGC TCCTGCCGAG AAAGTATCCA TCATGGCTGA TGCAATGCGG CGGCTGCATA 
4 321 CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA ACATCGCATC GAGCGAGCAC 
4 3 81 GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT GGACGAAGAG CATCAGGGGC 
4441 TCGCGCCAGC CGAACTGTTC GCCAGGCTCA AGGCGCGCAT GCCCGACGGC GAGGATCTCG 
45 01 TCGTG AC CCA TGGCGATGCC TGCTTGCCGA ATATCATGGT GGAAAATGGC CGCTTTTCTG 
4 561 GATTCATCGA CTGTGGCCGG CTGGGTGTGG CGGACCGCTA TCAGGACATA GCGTTGGCTA 
4621 CCCGTGATAT TGCTGAAGAG CTTGGCGGCG AATGGGCTGA CCGCTTCCTC GTGCTTTACG 
4681 GTATCGCCGC TCCCGATTCG CAGCGCATCG CCTTCTATCG CCTTCTTGAC GAGTTCTTCT 
4 741 GAGCGGGACT CTGGGGTTCG AAATGAC CGA CCAAGCGACG CCCAACCTGC CATCACGATG 
4801 GCCGCAATAA AATATCTTTA TTTTCATTAC ATCTGTGTGT TGGTTTTTTG TGTGAATCGA 
4 861 TAGCGATAAG GATCCGCGTA TGGTGCACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT 
4 921 TAAGCCAGCC CCGACACCCG CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC 
4 981 CGGCATCCGC TTACAGACAA GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAGGTTTT 

'5'041 CACCGTCATC ACCGAAACGC GCGAGACGAA AGGGCCTCGT GATACGCCTA TTTTTATAGG 
5101 TTAATGTCAT GATAATAATG GTTTCTTAGA CGTCAGGTGG CACTTTTCGG GGAAATGTGC 
5161 GCGGAACCCC TATTTGTTTA TTTTTCTAAA TACATTCAAA TATGTATCCG CTCATGAGAC 
5221 AATAACCCTG ATAAATGCTT CAATAATATT GAAAAAGGAA GAGTATGAGT ATTCAACATT 
52 81 'TCCQTGTCGC CCTTATTCCC TTTTTTGCGG CATTTTGCCT TCCTGTTTTT GCTCACCCAG 
5341 AAACGCTGGT GAAAGTAAAA GATGCTGAAG ATCAGTTGGG TGCACGAGTG GGTTACATCG 
54 01 AACTGGATCT CAACAGCGGT AAGATCCTTG AGAGTTTTCG CCCCGAAGAA CGTTTTCCAA 
54 61 TGATGAGCAC TTTTAAAGTT CTGCTATGTG GCGCGGTATT ATCCCGTATT GACGCCGGGC 
5521 AAGAGCAACT CGGTCGCCGC ATACACTATT CTCAGAATGA CTTGGTTGAG TACTCACCAG 
5581 TCACAGAAAA GCATCTTACG GATGGCATGA CAGTAAGAGA ATTATGCAGT GCTGCCATAA 
5641 CCATGAGTGA TAACACTGCG GCCAACTTAC TTCTGACAAC GATCGGAGGA CCGAAGGAGC 
57 01 TAACCGCTTT TTTGCACAAC ATGGGGGATC ATGTAACTCG CCTTGATCGT TGGGAAC CGG 
5761 AGCTGAATGA AGCCATACCA AACGACGAGC GTGACACCAC GATGCCTGTA GCAATGGCAA 





58 21 CAACGTTGCG CAAACTATTA ACTGGCGAAC TACTTACTCT AGCTTCCCGG CAACAATTAA 
5881 TAGACTGGAT GGAGGCGGAT AAAGTTGCAG GACCACTTCT GCGCTCGGCC CTTCCGGCTG 
S94X GCTGGTTTAT TGCTGATAAA TCTGGAGCCG GTGAGCGTGG GTCTCGCGGT ATCATTGCAG 
6 001 CACTGGGGCC AGATGGTAAG CCCTCCCGTA TCGTAGTTAT CTACACGACG GGGAGTCAGG 
6 061 CAACTATGGA TGAACGAAAT AGACAGATCG CTGAGATAGG TGCCTCACTG ATTAAGCATT 
6121 GGTAACTGTC AGACCAAGTT TACTCATATA TACTTTAGAT TGATTTAAAA CTTCATTTTT 
6181 AATTTAAAAG GATCTAGGTG AAGATCCTTT TTGATAATCT CATGACCAAA ATCCCTTAAC 
6241 GTGAGTTTTC GTTCCACTGA GCGTC AG AC C CCGTAGAAAA GATCAAAGGA TCTTCTTGAG 
63 01 ATCCTTTTTT TCTGCGCGTA ATCTGCTGCT TGCAAACAAA AAAACCACCG CTACCAGCGG 
6361 TGGTTTGTTT GCCGGATCAA GAGCTACCAA CTCTTTTTCC GAAGGTAACT GGCTTCAGCA 
6421 GAGCGCAGAT ACCAAATACT GTCCTTCTAG TGTAGCCGTA GTTAGGCCAC CACTTCAAGA 
6481 ACTCTGTAGC ACCGCCTACA TACCTCGCTC TGCTAATCCT GTTACCAGTG GCTGCTGCCA 
6 541 GTGGCGATAA GTCGTGTCTT ACCGGGTTGG ACTCAAGACG ATAGTTACCG GATAAGGCGC 
66 01 AGCGGTCGGG CTGAACGGGG GGTTCGTGCA CACAGCCCAG CTTGGAGCGA ACGACCTACA 
6661 CCGAACTGAG ATACCTACAG CGTGAGCATT GAGAAAGCGC CACGCTTCCC GAAGGGAGAA 
6721 AGGCGGACAG GTATCCGGTA AGCGGCAGGG TCGGAACAGG AGAGCGCACG AGGGAGCTTC 
6781 CAGGGGGAAA CGCCTGGTAT CTTTATAGTC CTGTCGGGTT TCGCCACCTC TGACTTGAGC 
6 841 GTCGATTTTT GTGATGCTCG TCAGGGGGGC GGAGCCTATG GAAAAACGCC AGCAACGCGG 
6 901 CCTTTTTACG GTTCCTGGCC TTTTGCTGGC CTTTTGCTCA CATGTTCTTT CCTGCGTTAT 

6 961 CCCCTGATTC TGTGGATAAC CGTATTACCG CCTTTGAGTG AGCTG AT AC C GCTCGCCGCA 

7 021 GCCGAACGAC CGAGCGCAGC GAGTCAGTGA GCGAGGAAGC GGAAGAGCGC CCAATACGCA 
7081 AACCGCCTCT CCCCGCGCGT TGGCCGATTC ATTAATGCAG AGCTTGCAAT TCGCGCGTTT 
7141 TTCAATATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC ATATTTGAAT 

72 01 GTATTTAGAA AAATAAACAA ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCC AC C TG 
7261 ACGTCTAAGA AACCATTATT ATCATGACAT TAACCTATAA AAATAGGCGT AGTACGAGGC 
7321 CCTTTCACTC ATTAGATGCA TGTCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 

73 81 CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
7441 ATAGGGACTT TCCATTGACG TCAATGGGTG GAGTATTTAC G 





pDEST27 GST Amino Fusion inpCMV«port-iiea 

Vector 



600 



4- 



nac ggt 
ntg eca 



ggg agg tct ' a ta^Eaagca gag etc get tag tga ace 
ccc tic aga tar. att cgt etc g ag caa ate act tgg dag 




651 



702 



753 



804 



cct gga 
gga cct 

gat cca 
eta ggt 

ggt tat 
cca ata 

tat ctt 
ata gaa 



gac gec 
ctg egg 

gec tec 
egg agg 

tgg aaa 
acc ctt 

gaa gaa 
ctt ctt 



ate cac get gtt ttg acc tec ata gaa gac acc ggg 
tag gtg cga caa aac tgg agg tat ctt^ctg tgg ccc 

qoa etc tag cct agg ccg egg acc |*tg gec cct ata 
lit gag ate gga tec ggc gec tgg ^^^^ft 

att aag ggc ctt gtg caa ccc act cga ctt ctt ttg 
taa ttl ccg gaa cac gtt ggg tga get gaa gaa aac 



3 gaa cac gtt ggg tga g 

AST VV^-r^v^ — 

a^iag cat ttg tat gag c 



1365 it t tt ggt 
f* aaa cca 

X 

1416 tea aca 



ggt ggc 
cca ccg 



agt 



tgt 



agt tt^£ 
tea aac 



aaa tat gaa^ag cat ccg cat g« 3 cgc gat gaa ggt 
ttc ata ctt etc gta aac ata etc gcg eta ctt cca 

V P fl. S 
aac cat cct cca aaa teg gat ctg gtt ccg cgt tct 
ctg gta gga ggt ttt age eta gac caa ggc gca aga 
Y * K A ^ 



acc 
tgg 

L 

eta 

gaa 
ctt 

eta 



aga 
tct 



tac aaa aaa get gaa cga gaa acg 
Afcq ttU ttt cga ctt_ gct, ctt toe f/ 



SnaB I 292 



F: ?. 

Z 3 S 



Eam1105l 6841 



Rsr II 5385 



Sst I 518 
Bbs I 581 
SsMI623 
EcoN I 638 



Bsg I 897 

Swa II 1056 



Xbal 1304 
Not I 1438 




EcoR I 1759 
PfftA I 1985 



•Sg/ II 2206 
0 S fl1O7l 2292 



Srfl2627 
Ava I 2628 
Sma I 2628 
Sal I 2887 



7WI11 14985 
Nar I 4869 

Afl II 4724 
Hind 111 4692. 
Stu I 4673 
BseR I 4657 
Sfi I 4622 

SexAl 4443 



Nnel3117 



Muni 3486 
Hpa I 3497 



Dra III 3988 



Kpn I 4326 



130 . . 


793 


803 . . 


927 


1036 . 


.1695 


1815 . 


.1899 


2037 . 


.2342 


2383 . 


.2507 


2693 . 


. .3055 


3250 . 


. .3705 


3769 


. .4187 


4232 


. .5026 


5090 


. . 5138 


5549 


. .6409 


6558 


. .7197 


7628 


. .27 



pDEST27 8123 bp (rotated to position 7800) 

r.o^Mon ( B*se Nos . ) Gene Encoded 

GST 
attRl 
CmR 

inactivated ccdA 
ccdB 
attR2 

SV4 0 polyA 

fl intergenic region 
SV4 0 promoter 
neo 
polyA 
Apr 
ori 

CMV promoter 

1 ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA GACGCCATCC ACGCTGTTTT 

si ga"?cca" gaagacaccg GGACCGATCC agcctccgga ctctagccta ^ccgcggac 

ill CATGGCCCCT ATACTAGGTT ATTGGAAAAT TAAGGGCCTT GTGCAACCCA CTCGACTTCT 

III SSgaatS S?gaagaaa AATATGAAGA GCATTTGTAT gagcgcgatg aaggtgataa 
III ATGGCGAAAC aaaaagtttg aattgggttt ggagtttccc aatcttcctt attatattga 

TGGTGATGTT AAATTAACAC AGTCTATGGC CATCATACGT TATATAGCTG -AAGCACAA 
361 CATGTTGGGT GGTTGTCCAA AAGAGCGTGC AGAGATTTCA ATGCTTGAAG ^CGGTTTT 
III gSatIttaga TACGGTGTTT CGAGAATTGC ATATAGTAAA GACTTTGAAA CTCTCAAAGT 
til SSSSSJ AGCAAGCTAC CTGAAATGCT GAAAATGTTC GAAGATCGTT TATGTCATAA 
til ZcZattII aa^gatc ATGTAACCCA TCCTGACTTC ATGTTGTATG ACGCTCTTGA 
III iSSSJS ^catggacc caatgtgcct ggatgcgttc ccaaaattag TTTGTTTTAA 
III IaaacgSt? Sagctatcc cacaaattga taagtacttg aaatccagca agtatatagc 
nil a^ScSSg SgIgSggc aagccacgtt tggtggtggc gaccatcctc caaaatcgga 

III ZritZcli CGTTCTAGAT CAACAAGTTT GTACAAAAAA GCTGAACGAG AAACGTAAAA 

III tcSSaaa? atcaatatat taaattagat tttgcataaa aaacagacta cataatactg 
sol JSIaSSaa SSSccagt cactatggcg gccgcattag gcaccccagg ctttacactt 

III ^A^ScS GCTCGTATAA TGTGTGGATT TTGAGTTAGG ATCCGGCGAG ATTTTCAGGA 

J" gSaaSaag Saaaatgga gaaaaaaatc actggatata ccaccgttga tatatcccaa 

?Sct?SS AAGAACATTT TGAGGCATTT CAGTCAGTTG CTCAATGTAC CTATAACCAG 

llll accgt^Sc ^attac ggccttttta aagaccgtaa agaaaaataa gcacaagttt 

llol JScCGGCCT TTATTCACAT TCTTGCCCGC CTGATGAATG CTCATCCGGA ATTCCGTATG 
llll GCAATGAAAG ACGGTGAGCT GGTGATATGG GATAGTGTTC ACCCTTGTTA CACCGTTTTC 
! CATGAGCAAA SgAAACGTT XTCATCGCTC TGGAGTGAAT ^-CGA ™™» 
T3 81 TTTCTACACA TATATTCGCA AGATGTGGCG TGTTACGGTG AAAACCTGGC CTATTTCCCT 

111 IIacggttta TTGAGAATAT gtttttcgtc TCAGCCAATC cctgggtgag TTTCACCAGT 
llol JJSSS acSggISaa tatggacaac ttcttcgccc ccgttttcac catgggcaaa 
llll ?a??a™ aaggcgacaa ggtgctgatg ccgctggcga ttcaggttca tcatgccgtc 
'llll totcntggct tccatgtcgg cagaatgctt aatgaattac aacagtactg cgatgagtgg 
lell Sgggcgggg cSSaagatc tggatccggc ttactaaaag ccagataaca gtatgcgtat 
I n il Sgcgcgctg atttttgcgg tataagaata tatactgata tgtatacccg aagtatgtca 
llol aaaagaggtg Sctatgaag cagcgtatta cagtgacagt tgacagcgac agctatcagt 
llll JSScaaggc ATATATGATG tcaatatctc cggtctggta agcacaacca tgcagaatga 
llll agcccotcS Sgcgtgccg aacgctggaa agcggaaaat caggaaggga tggctgaggt 
llll ScccggtS attgaaatga acggctcttt tgctgacgag aacagggact ggtgaaatgc 
loll agStaaggt ttacacctat aaaagagaga gccgttatcg tctgtttgtg gatgtacaga 
ITol gIgXSSSS tgacacgccc gggcgacgga tggtgatccc cctggccagt gcacgtctgc 
llll ?SS2ataa aotScccgt gaactttacc cggtggtgca tatcggggat gaaagctggc 
m£ IZtoaTg^ cIScgatatg gccagtgtgc cggtctccgt tatcggggaa gaagtggctg 

llll SScAGCCA CCGCGAAAAT GACATCAAAA ACGCCATTAA CCTGATGTTC TGGGGAATAT 



2341 AAATGTCAGG CTCCCTTATA CACAGCCAGT CTGCAGGTCG ACCATAGTGA CTGGATATGT 
24 01 TGTGTTTTAC AGTATTATGT AGTCTGTTTT TTATGCAAAA TCTAATTTAA TATATTGATA 
2461 TTTATATCAT TTTACGTTTC TCGTTCAGCT TTCTTGTACA AAGTGGTTGA TCGCGTGCAT 
2 521 GCGACGTCAT AGCTCTCTCC CTATAGTGAG TCGTATTATA AGCTAGGCAC TGGCCGTCGT 
2581 TTTACAACGT CGTGACTGGG AAAACTGCTA GCTTGGGATC TTTGTGAAGG AACCTTACTT 
2641 CTGTGGTGTG ACATAATTGG ACAAACTACC TACAGAGATT TAAAGCTCTA AGGTAAATAT 
2701 AAAATTTTTA AGTGTATAAT GTGTTAAACT AGCTGCATAT GCTTGCTGCT TGAGAGTTTT 
2761 GCTTACTGAG TATGATTTAT GAAAATATTA TACACAGGAG CTAGTGATTC TAATTGTTTG 
2 821 TGTATTTTAG ATTCACAGTC CCAAGGCTCA TTTCAGGCCC CTCAGTCCTC ACAGTCTGTT 
2 881 C ATG AT CAT A ATC AGC CAT A CCACATTTGT AGAGGTTTTA CTTGCTTTAA AAAACCTCCC 

2 941 ACACCTCCCC CTGAACCTGA AACATAAAAT GAATGCAATT GTTGTTGTTA ACTTGTTTAT 

3 001 TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT 
3 061 TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG 
3121 GATCGATCCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG CGTATTGGCT 
3181 GGCGTAATAG CGAAGAGGCC CGCACCGATC GCCCTTCCCA ACAGTTGCGC AGCCTGAATG 
3241 GCGAATGGGA CGCGCCCTGT AGCGGCGCAT TAAGCGCGGC GGGTGTGGTG GTTACGCGCA 

33 01 GCGTGACCGC TACACTTGCC AGCGCCCTAG CGCCCGCTCC TTTCGCTTTC TTCCCTTCCT 
3361 TTCTCGCCAC GTTCGCCGGC TTTCCCCGTC AAGCTCTAAA TCGGGGGCTC CCTTTAGGGT 
3421 TCCGATTTAG TGCTTTACGG CACCTCGACC CCAAAAAACT TGATTAGGGT GATGGTTCAC 

34 81 GTAGTGGGCC ATCGCCCTGA TAGACGGTTT TTCGCCCTTT GACGTTGGAG TCCACGTTCT 
3 541 TTAATAGTGG ACTCTTGTTC CAAACTGGAA CAACACTCAA CCCTATCTCG GTCTATTCTT 
3601 TTGATTTATA AGGGATTTTG CCGATTTCGG CCTATTGGTT AAAAAATGAG CTGATTTAAC 
3 661 AAATATTTAA CGCGAATTTT AACAAAATAT TAACGTTTAC AATTTCGCCT GATGCGGTAT 
3 721 TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAC GCGGATCTGC GCAGCACCAT 
3 781 GGCCTGAAAT AACCTCTGAA AGAGGAACTT GGTTAGGTAC CTTCTGAGGC GGAAAGAACC 
3 841 AGCTGTGGAA TGTGTGTCAG TTAGGGTGTG GAAAGTCCCC AGGCTCCCCA GCAGGCAGAA 
3 901 GTATGCAAAG CATGCATCTC AATTAGTCAG CAACCAGGTG TGGAAAGTCC CCAGGCTCCC 

3 961 CAGCAGGCAG AAGTATGCAA AGCATGCATC TCAATTAGTC AGC AAC CAT A GTCCCGCCCC 

4 021 TAACTCCGCC CATCCCGCCC CTAACTCCGC CCAGTTCCGC CCATTCTCCG CCCCATGGCT 
4 081 GACTAATTTT TTTTATTTAT GCAGAGGCCG AGGCCGCCTC GGCCTCTGAG CTATTCCAGA 
4141 AGTAGTGAGG AGGCTTTTTT GGAGGCCTAG GCTTTTGCAA AAAGCTTGAT TCTTCTGACA 
42 01 CAACAGTCTC GAACTTAAGG CTAGAGCCAC CATGATTGAA CAAGATGGAT TGCACGCAGG 
4261 TTCTCCGGCC GCTTGGGTGG AGAGGCTATT CGGCTATGAC TGGGCACAAC AGACAATCGG 
4321 CTGCTCTGAT GCCGCCGTGT TCCGGCTGTC AGCGCAGGGG CGCCCGGTTC TTTTTGTCAA 
4 381 GACCGACCTG TCCGGTGCCC TGAATGAACT GCAGGACGAG GCAGCGCGGC TATCGTGGCT 
4441 GGCCACGACG GGCGTTCCTT GCGCAGCTGT GCTCGACGTT GTCACTGAAG CGGGAAGGGA 
4 501 CTGGCTGCTA TTGGGCGAAG TGCCGGGGCA GGATCTCCTG TCATCTCACC TTGCTCCTGC 
4 561 CGAGAAAGTA TCCATCATGG CTGATGCAAT GCGGCGGCTG CATACGCTTG ATCCGGCTAC 
4 621 CTGCCCATTC GACCACCAAG CGAAACATCG CATCGAGCGA GCACGTACTC GGATGGAAGC 
46 81 CGGTCTTGTC GATCAGGATG ATCTGGACGA AGAGCATCAG GGGCTCGCGC CAGCCGAACT 
4 741 GTTCGCCAGG CTCAAGGCGC GCATGCCCGA CGGCGAGGAT CTCGTCGTGA CCCATGGCGA 
4 8 01 TGCCTGCTTG CCGAATATCA TGGTGGAAAA TGGCCGCTTT TCTGG ATT C A TCGACTGTGG 
4 861 CCGGCTGGGT GTGGCGGACC GCTATCAGGA CATAGCGTTG GCTACCCGTG ATATTGCTGA 
4921 AGAGCTTGGC GGCGAATGGG CTGACCGCTT CCTCGTGCTT TACGGTATCG CCGCTCCCGA 
4 981 TTCGCAGCGC ATCGCCTTCT ATCGCCTTCT TGACGAGTTC TTCTGAGCGG GACTCTGGGG 
'5041 TTCGAAATGA CCGACCAAGC GACGCCCAAC CTGCCATCAC GATGGCCGCA ATAAAATATC 

5101 TTTATTTTCA TTACATCTGT GTGTTGGTTT TTTGTGTGAA TCGATAGCGA TAAGGATCCG 
5161 CGTATGGTGC ACTCTCAGTA CAATCTGCTC TGATGCCGCA TAGTTAAGCC AGCCCCGACA 
5221 CCCGCCAACA CCCGCTGACG CGCCCTGACG GGCTTGTCTG CTCCCGGCAT CCGCTTACAG 
5281 'ACAAGCTGTG ACCGTCTCCG GGAGCTGCAT GTGTCAGAGG TTTTCACCGT CATCACCGAA 
5341 ACGCGCGAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT 
54 01 AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG 
5461 TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT 
5521 GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT 
5 581 TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT 
5641 AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG 
5701 CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA 
57 61 AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG 





5821 CCGCATACAC TATTCTCAGA ATGACTTGGT ™™g ™CAG J£™ 
5881 TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC JJAACCA CTTTTTTGCA 
594! TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG JAGCTAACCG 
6001 CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA <~CG TGCGCAAACT 
6061 ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG ^AACGT GGATGGAGGC 
6121 ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA ™TAG TTATTGCTGA 
6181 GGATAAAGTT GCAGGACCAC TTCTGCGCTC ^CTTCCG ^TGGC GGCCAGATGG 
6241 TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT ^CACT TGGATGAACG 
6301 TAAGCCCTCC CGTATCGTAG TTATCTACAC JACGGGGAGT CATTGGTAAC TGTCAGACCA 
6361 AAATAGACAG ATCGCTGAGA TAGGTGCCTC JCTG^TAAG ^TT 

6421 AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT ^TTTAA TTTCGTTCCA 
6481 GGTGAAGATC CTTTTTGATA ATCTCATGAC JAAAATCCCT SScCTT TTTTTCTGCG 
6541 CTGAGCGTCA GACCCCGTAG £JJ«^£ SSIcS GCG^GGTTT GTTTGCCGGA 

6601 SSSE £££££ aaSggcS agcagagcgc agataccaaa 

:;s ssss ^ cg™ ^ = ™cc 

6781 TACATACCTC GCTCTGCTAA JCCTCTTACC JGTG GCGCAGCGGT CGGGCTGAAC 

5=55 iiEE EE sssss sssss sssss 

6 961 ACAGCGTGAG cattgagaaa gcgccacgct cttccagggg gaaacgcctg 

7021 GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG ^CC TTTTGTGATG 
7081 GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT ^GCGTCGAT ^ 
7141 CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC ^GGCCT ATTCTGTGGA 

S ====== == === 

7381 GCGTTGGCCG ATT C ATTAAT ^AGAGCTTG CAATTCGCGC J^TTCAAT JTT^^ 
7441 CATTTATCAG GGTTATTGTC JCATGAGCGG JTACATATTT ^ ^ 

S = §= EE S= SS 

™ US =S S= ssss esse ssss 

= =£ 2= SSi SEE 

is; ssxs sssss ssss ess~ .™™ -™». 

8101 GGCGTGTACG GTGGGAGGTC TAT 





Figure frl pEXPSOl: pCMV* SPORT 6 host for attB Libraries 
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Figure p£*P*5x>/ Ca>«t"4). Features of the a tt B cloning 

vector, p£5cpSC>l. Bases wfthin hatched area are 
replaced by cDNA in some LTI cDNA libraries. 

aga get cgt tta gtg aac cg|t cag ate gee tgg aga cgc cat cca 

t>ct cga gca aat cac ttg gca gtc tag egg acc tct gcg gta ggt 



cgc tgt ttt gac etc cat aga aga cac egg gac cga tec age etc 
gcg aca aaa ctg gag gta tct tct gtg gee ctg get agg teg gag 



Vrt X trri> rev j 

egg act eta gee tag gec gefe *gag egg ata aca att tea cac agg* 
gec tga gat egg ate cggTcgc etc gec tat tgt taa agt gtg tec 

■ 

- " - — *- — *- ' Jga 



aaa cag eta tga cca tta ggb ct^i ttt agg tga cac tat afgaFaca" 
ttt gtc gat act ggt aat ccb gat aaa tec act gtg ata tct \tgt 

**** % - **« **** 

"agt ttjg tac aaa aaa gca ggc ttpg tqc] cqb tec g ^pi att ccc Iggg 
atf£ atfl tfcElttt cgt: ccg afcfc atg gee I agfr ccttaal ggg c 



tea 



&Efl tfcE^ttt cgt: ccg a^ : atg gcc|ag*g cct taajggg^ccc 



:'/gf ta- c gc { ggc c gc t {ct aga gta tec 



ta t//agg/ agcV^c- teg agt gat c 



^'.c cg ccg g^:g agat^t 



cat agg 

VI Wit *AU 



c jtc ga g ggg ccfc a frg ct t a fcg cg t acc cag ctt t (ct tgt aca aag 



gag c 



t|c cpggg ttc ga^i tgc g<Ji [tgg gtc gaa aga aca t^ 



ttc 



"tgg tvrc eta tag tga gtc gta tta taa get agg cac tgg ccg teg 

acc apg gat ate act cag cat aat 
l 1 



it att cga tec gctg acc ggc age 



ttt tac aac gtc gtg act 

aaa atg y g cag cac tga ccc, ttt tga cga c 



ggg aaa act gfet age ttg gga tct ttg 



aac cct aga aac 



pEXP501 4396 bp 




61 
121 
181 
241 
301 
361 



1 CCATTCGCCA TTCAGGCTGC GCAACTGTTG GGAAGGGCGA TCGGTGCGGG CCTCTTCGCT 

attSgccag" ccaatacgca aaccgcctct ccccgcgcgt tggccgattc attaatgcag 
SJcgatcca gacatgataa gatacattga tgagtttgga caaaccacaa ctagaatgca 
gtSSSSaa tgctttattt gtgaaatttg tgatgctatt gctttatttg taaccattat 
aagc^£a£t" aaacaagtta acaacaacaa ttgcattcat tttatgtttc aggttcaggg 
ggaggtgtgg gaggtttttt aaagcaagta aaacctctac aaatgtggta tggctgatta 
361 tgatcatgaa cagactgtga ggactgaggg gcctgaaatg agccttggga ctgtgaatct 
42i IaaIScaca aacaattaga atcactagct cctgtgtata atattttcat aaatcatact 
til cI^aagcaa aactctcaag cagcaagcat atgcagctag tttaacacat tatacactta 
sll aaaat^ttJt atttacctta gagctttaaa tctctgtagg tagtttgtcc aattatgtca 
601 Saccacagaa gtaaggttcc ttcacaaaga tcccaagcta gcagttttcc cagtcacgac 
sl\ gt"tgt"aa2£ gacggccagt gcctagctta taatacgact cactataggg accactttgt 
6 7 6 2 i acaaSa^c tgggtacgcg taagcttggg cccctcgagg -tcctctag -cggccgcc 

781 GACTAGTGAG CTCGTCGACG ATATCCCGGG AATTCCGGAC CGGTACCAGC CTGCTTTTTT 
III gtacaaactt gttctatagt gtcacctaaa TAGGCCTAAT GGTCATAGCT GTTTCCTGTG 
901 TGaSStGTT ATCCGCTCCG CGGCCTAGGC TAGAGTCCGG AGGCTGGATC GGTCCCGGTG 

96i tct^tatgg aggtcaaaac agcgtggatg gcgtctccag gcgatctgac ggttcactaa 
io" acgagSctg cttatataga cctcccaccg tacacgccta ccgcccattt gcgtcaatgg 
i08i SSggagttg ttacgacatt ttggaaagtc ccgttgattt tggtgccaaa acaaactccc 
ii4i attgIcotca atggggtgga gacttggaaa tccccgtgag tcaaaccgct atccacgccc 
^201 attga^gtac" tgccaaaacc gcatcaccat ggtaatagcg atgactaata cgtagatgta 

1261 CTGCCAAGTA GGAAAGTCCC ATAAGGTCAT GTACTGGGCA TAATGCCAGG CGGGCCATTT 

i32i aStcIttg" acgtcaatag ggggcgtact tggcatatga tacacttgat gtactgccaa 

Ull GTGGGCAGTT TACCGTAAAT ACTCCACCCA TTGACGTCAA TGGAAAGTCC CTATTGGCGT 

llll tactatSgga acatacgtca ttattgacgt caatgggcgg gg^ottoo ocggtcagcc 



1501 AGGCGGGCCA ESSSS GTTATGTAAC GACAXGCAXC TAATGAGTGA AAGGGCCTC3 

mi sssse i;;™ ™^ ™^ 



1621 
1681 
1741 
1801 



TAPTACGCCT ATTTTTATAG GTTAAl'tiTLK i^±^x^-l - - — 

GCACTTTTCG GGGAAATGTG CGCGGAACCC CTATTTGTTT ATTTTTCTAA ATACATTCAA 
ATATGTATCC GCTCATGAGA CAATAACCCT GATAAATGCT TCAATAATAT TGAAAAACGC 
gcgaattgca AGCTCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA 

Sg^SSc" t?ccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc 

GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG 



2041 GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT 
2?01 CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 
llel cttcgggaag CGTGGCGCTT TCTCAATGCT CACGCTGTAG gtatctcagt TCGGTGTAGG 
2221 tcgttcgSc caagctgggc TGTGTGCACG aaccccccgt tcagcccgac cgctgcgcct 
2281 tScStaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag 

llll CAGcSSS TAACAGGATT AGC AG AG CGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA 

2401 aotgSggcc taactacggc tacactagaa ggacagtatt TGGTATCTGC GCTCTGCTGA 
ItS agccaottac cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg 
2521 gSgSSSg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag 
2581 aagItccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag 
llll SStttggt catgccataa cttcgtatag catacattat acgaagttat ggcatgagat 
2V0I tat"caaaaag gatcttcacc tagatccttt taaattaaaa atgaagtttt aaatcaatct 



61 aaagtatata tSagtaaact tggtctgaca gtxaccaatg cttaatcagt gaggcaccta 
tIScaSSat ctgtctattt cgttcatcca tagttgcctg actccccgtc gtgtagataa 

CTACGATACG GGAGGGCTTA CCATCTGGCC CCAGTGCTGC AATG AT AC CG CGAGACCCAC 



2821 

2941 gScaS'cggc tccagattta tcagcaataa accagccagc cggaagggcc gagcgcagaa 
3ooi gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg gaagctagag 

3061 
3121 



TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTACA GGCATCGTGG 
tgtcacgctc GTCGTTTGGT ATGGCTTCAT tcagctccgg TTCCCAACGA TCAAGGCGAG- 



,181 TTACATGATC CCCCATGTTG TGCAAAAAAG CGGTTAGCTC CTTCGGTCCT CCGATCGTTG 

55b ESSE SSSS SSSS ssss ssss 

SSSS SSSS SEES SEES =555 SSSS 

i™ = ssss ssss ssss ssss 

SEE SSSS SSSS ssss ssss ssss 

~ ssss sss ssss ssss ssss 

Ta!i ccaScSag acctggcatg agcggataca tatttgaatg tatttagaaa aataaacaaa 
tSSgS^c Scgcacattt ccccgaaaag tgccacctga aattgtaaac gttaatattt 

EE 3SSS SSSS SSSS SSSS SEES 

iiE ssss ssss ssss ssss ssss 
™ ssss ssss ssss = ssss 

43 21 cgctggcaag tgtagcggtc acgctgcgcg taaccaccac acccgccgcg cttaatgcgc 



43 81 CGCTACAGGG CGCGTC 



NR201 (kanR) 




BspLU11 14161 
>\flll1 4161 
Drtf I 4053 
BssS I 3988 
Hae II 3917 



Bbs I 4416 
Bs/8 I 4346 
BsaH I 4298 
Nsp 14161 



Hpal 1 
yApa I 22 

' EcoO109 



Eco57 I 3634 



Vsp I 3188 



Pvu I 2987 
Sgfl2986 

EcoN I 2898 




BspM I 620 
Bsa I 741 
BsOC I 757 
Mam I 860 
Srf I 874 
Sma I 875 



| Bst 1107 I 1211 

- BssH II 1250 
SamH I 1291 
Seal 1329 

Dsa I 1443 
/Vcol 1443 
S/yl 1443 



G$u\ 1643 

EcoRI 1744 
Kprt2l 1748 



Nru I 2645 



BsaA I 2053 





pDONR201 4470 bp (rotated to position 3516) 

L ocation_JBas^NosJ. Gptir Encoded 

attPl 



ccdB 



260 . .29 
656 . - 961 

1099.. 1184 ccdA 

1303.. 1962 CmR 

2210.. 2442 attP2 

2565.-3374 Kmr 

3495.. 4134 ori 



1 GTTAACGCTA GCA.GGA.C, CGGGCCCCAA ™- -™ ^ 
61 TGTTCGTTGC AACAAATTGA TGAGCAATGC TTTTTTATAA TTTG CATAAA 
121 GCTGAACGAG AAACGTAAAA TGATATAAAT JTCAATATAT JAAATTA CAACTACTTA 
1B1 AAACAGACTA CATAATACTG ^AAAACACAA CATATCCAGT CACTA GGTGATGCTG 
241 GATGGTATTA GTGACCTGTA ^CGACCGAC JGCCTTCCAA ™ TCGTATCCAG 
301 CCAACTTAGT CGACCGACAG CCTTCCAAAT ^TTCTCA ^ aaaaagAGGT 
361 CCTACTCGCT ATTGTCCTCA ATGCCGTATT AAAJCATAAA CGCTAGTGTC 
421 GCGAGCCTCT TTTTTGTGTG ACAAAATAAA ^ATCTACC TATTCATA 

481 ATAGTCCTGA AAATCATCTG CATCAAGAAC JATTTCACAA JJCTTATACT ^ 
541 AAGT CGTTCG GCTTCATCTG GATTTTCAGC CTCTATACTT ^AAA GACATTTATA 
601 TGTAATTTCT ACTGTATCGA CCTGCAGACT ^CTGTGTAT CTGAGATCAG 
661 TTCCCCAGAA CATCAGGTTA ^GGCGTTTT TGATGTCATT J^GC ATCATGCGCC 
721 CCACTTCTTC CCCGATAACG GAGACCGGCA JACTSGOCM ^CGG TCTGACAGCA 
781 AGCTTTCATC CCC-TATGC JC«™£ £££££ CGTGTCAATA ATATCACTCT 
841 GACGTGCACT GGCCAGGGGG J^ACCA GGTGTAAACC TTAAACTGCA 

901 GTACATCCAC AAACAGACGA TAACG °^CT ^CTTTTA CCGGGCGACC 
961 TTTCACCAGT CCCTGTTCTC GTCAGCAAAA GAGCCGTTCA ^CAA ACGGGCTTCA 
1021 TCAGCCATCC CTTCCTGATT TTCCGCTTTC CAGCGTTCGG ^GCAG TTGAGCAACT 
1081 TTCTGCATGG TTGTGCTTAC CAGACCGGAG JTATTGACAT BATATA CTCTTTTTGA 
U41 GATAGCTGTC GCTGTCAACT ^CACTGTAA TACGCTGCTT CATAGC^ 
12 01 CATACTTCGG GTATACATAT CAGTATATAT ^TTATA GATTACGCCC CGCCCTGCCA 
12 61 CGCATACTGT TATCTGGCTT ^TAGTAAGCC ^CACGC ^TTA CATCACAGAC 
1321 CTCATCGCAG TACTGTTGTA ^TTCATTAAG CATTCTGCCG f f G TATAATATTT 
1381 GGCATGATGA ACCTGAATCG CCAGCGGCAT ^AGCACCTTG ^CCT ^AT C AAAACT 

1441 GCCCATGGTG AAAACGGGGG CGAAGAAGTT GTCCATATTG ^CACGTT 
1501 GGTGAAACTC ACCCAGGGAT TGGCTGAGAC ^AAAAACATA JTCTCAATAA ^ 
1561 GAAATAGGCC AGGTTTTCAC CGTAACACGC CACATCTTGC J^ATA GCT C ATGG AA 

1621 CCGGAAATCG TCGTGGTATT CACTCCAGAG CGATGAAAAC ^TTCAG CCAT 
1681 AACGGTGTAA CAAGGGTGAA CACTATCCCA ^ATCACCAGC TCACCGTC CCGGATAAAA 
1741 ACGGAATTCC GGATGAGCAT TCATCAGGCG ^^SAATQ ^ GAACGGTCTG 
1801 CTTGTGCTTA TTTTTCTTTA CGGTCTTTAA ^CCGTA ^ATCCAG TQ 
1861 GTTATAGGTA CATTGAGCAA CTGACTGAAA TGCCTCAAAA TGTTCTT 

1921 GGATATATCA ACGGTGGTAT ^CCAGTGAT TTTTTTCTCC J TATGGTGAAA 
. 1981 TGAAAATCTC GATAACTCAA AAAATACGCC CGGTAGTGAT CT^^ ccCAGGGCTT 
' 2041 GTTGGAACCT CTTACGTGCC GATC^CGTC ^ATTCTGCG AAGTGATCTT CCGTCACAGG 
2101 CCCGGTATCA ACAGGGACAC CAGGATTTAT TTAH TTAGTCGACT ACAGGTCACT 

2161 TATTTATTCG GCGCAAAGTG CGTCGGGTGA JGCTGCCAAC ™ AGTATTATGT 
22 21 _ AATACCATCT AAGTAGTTGA TTCATAGTGA JTGGATATGT TTTACGTTTC 
2281 AGTCTGTTTT TTATGCAAAA TCTAAT™ StScTTAT CAATTTGTTG 

2341 TCGTTCAGCT TTCTTGTACA ^TTGGCAT ™TAAG CTGCAGCTCT 
2401 CAACGAACAG ^ATCA ^CAAAATAA ^ATTATT TGC^^ TCATCATQAA 
2461 GGCCCGTGTC TCAAAATCTC TGATGTTACA TG TTATGAGC CATATTCAAC 

2521 CAATAAAACT GTCTGCTTAC ATAAACAGTA JTACAAGGGG TGTTATGAG 
2581 GGGAAACGTC GAGGCCGCGA TTAAATTCCA JCATGGATGC ^GATTTA GGGAAGCCCG 

2 2 6 7 4 oi SSSS SEES SSSSS £S= lt cc ^ g™™ 





AGATGGTCAG ACTAAACTGG CTGACGGAAT TTATGCCTCT TCCGACCATC AAGCATTTTA 

2821 ?cc^actc"c ?ga?g1tgca tggttactca ccactgcgat ccccggaaaa acagcattcc 
ex Staga agaatatcct gattcaggtg aaaatattgt tgatgcgctg ocagtgttcc 
, q41 tgcgCCGGTT gcattcgatt cctgtttgta attgtccttt taacagcgat cgcgtatttc 
4 oi Sc?cgSS ggcgcaatca cgaatgaata acggtttggt toatgcgagt ^-tgatg 
3 061 acgagcgtaa tggctggcct gttgaacaag tctggaaaga aatgcataaa cttttgccat 
tctcaccgga ttcagtcgtc actcatggtg atttctcact tgataacctt atttttgacg 
3i8i agggSaaatt aataggttgt attgatgttg gacgagtcgg aatcgcagac cgataccagg 
\\\l atcttSSIt Sctatggaac tgcctcggtg agttttctcc ttcattacag aaacggcttt 
330^ ttcaaaaata tggtattgat aatcctgata tgaataaatt gcagtttcat ttgatgctcg 

H A ATGAGTTTTT CTAATCAGAA TTGGTTAATT GGTTGTAACA CTGGCAGAGC ATTACGCTGA 
HA CTTGACGGGA cSSgSSgC TCATGACCAA AATCCCTTAA CGTGAGTTTT CGTTCCACTG 
Itll AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA GATCCTTTTT TTCTGCGCGT 

Itll aSSgcSc Sgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt tgccggatca 
lltl aSagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga taccaaatac 
366^ JSccScS StSagccgt agttaggcca ccacttcaag aactctgtag caccgcctac 
lill IScctSct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata agtcgtgtct 
3781 taccgggttg SScaagac gatagttacc ggataaggcg cagcggtcgg gctgaacggg 
llll GGGTTCGTGC acacagccca gcttggagcg aacgacctac accgaactga gatacctaca 
llol g?SgagSa tcagaaagcg ccacgcttcc cgaagggaga aaggcggaca ggtatccggt 
396 i aagcScagg" Scggaacag gagagcgcac gagggagctt ccagggggaa acgcctggta 

Toll TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGTCGATTTT TGTGATGCTC 

loll gtcaSgSg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac GGTTCCTGGC 
till SSSSgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt ctgtggataa 
^201 ccSattacc gctagccagg aagagtttgt agaaacgcaa aaaggccatc cgtcaggatg 
till gcS™S Sgtttgatg cctggcagtt tatggcgggc gtcctgcccg ccaccctccg 
ggccgttgc? Sacaacgtt caaatccgct cccggcggat ttgtcctact caggagagcg 

4381 ScACCGACA AACAACAGAT AAAACGAAAG GCCCAGTCTT CCGACTGAGC CTTT CGTTTT 
4441 ATTTGATGCC TGGCAGTTCC CTACTCTCGC 





BspMI 
Bsal. 
BstXI : 
Btrl. 
BsaBI. : 
Xmai.-'- 
SmalV: : 
Srfl V\ \ 



Xmnl 



BbvCI. 



Bst1107l. 
BssHII. 
BamHI. * 
Seal.. 



StyL-y 
Ncol/ 
BstDSI 



Bpml/ 
Psp1406l 

EcoR 
Acclll 




ccdB(decUh) 

4000 



(ccdA) 



BsaAl 



pDONFS02 



EcoO109l 
Apal 
PspOMI 
.Nhel 
• .Afllll 
• : Nspl 
r.Pcil 




1000 




...Eco57l 



Vspl 



-•.Sgfl 
Pvul 
EcoNI 



Earl 



Nrul 



pDONR202 4204 bp 



Location (Bas e Nos . ) 
369 . . 127 
486 . . 1059 
1228 . 
2381 
2629 
3408 



2107 
2140 
, 3288 
. 3492 



3630 . .3935 



Gene Encoded 

attPl 

ori 

KmR 

attP2 

CmR 

inactivated ccdA 
ccdB 



1 CGGCATTGAG GACAATAGCG AGTAGGCTGG ATACGACGAT TCCGTTTGAG AAGAACATTT 
6i ggaag~gctg~t CGGTCGACTA AGTTGGCAGC ATCACCCGAA GAACATTTGG AAGGCTGTCG 

121 gtSaSaca ggtcactaat accatctaag tagttgattc atagtgactg gatatgttgt 
lei gtSScagt attatgtagt ctgtttttta tgcaaaatct aatttaatat attgatattt 

III SSSSS ACGTTTCTCG TTCAGCTTTT TTGTACAAAG TTGGCATTAT AAAAAAGCAT 
201 TGCTCATCAA TTTGTTGCAA CGAACAGGTC ACTATCAGTC AAAATAAAAT CATTATTTGG 
III ggccSaSS CCATGCTAGC GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA 
421 aagaacatot gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 
Hi Sg^Stcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 
sll aggtgIcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 
60i gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 
661 ggaagcgSg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 
nil cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 
781 ggtaaSatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 
III actStIaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 
Vol tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca 
lei gSacc^cg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 
1021 ggSgtStt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 
llll cctttgISt tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 
llll SgScatga gcttgcgccg tcccgtcaag tcagcgtaat gctctgccag tgttacaagc 

llll AATTAAC C AA TTCTGATTAG AAAAACTCAT CGAGCATCAA ATGAAACTGC AATTTATTCA 
i26i ^cSga?t atcaatacca TATTTTTGAA AAAGCCGTTT CTGTAATGAA GGAGAAAACT 
caccgISgcX gttccatagg atggcaagat cctggtatcg gtctgcgatt ccgactcgtc 

"si CAACATCAAT ACAACCTATT AATTTCCCCT CGTCAAAAAT AAGGTTATCA AGTGAGAAAT 
llll ca?catgagt GACGACTGAA TCCGGTGAGA ATGGCAAAAG TTTATGCATT TCTTTCCAGA 
Itol StgScaac aggccagcca ttacgctcgt catcaaaatc actcgcatca accaaaccgt 

1S61 TATTCATTCG TGATTGCGCC TGAGCGAGAC GAAATACGCG ATCGCTGTTA AAAGGACAAT 
llll TACAAACAGG AATCGAATGC AACCGGCGCA GGAACACTGC CAGCGCATCA ACAATATTTT 
llll CAC^SaATC AGGATATTCT TCTAATACCT GGAATGCTGT TTTTCCGGGG ATCGCAGTGG 

mi tgagtaacca TGCATCATCA GGAGTACGGA TAAAATGCTT gatggtcgga agaggcataa 

llll ATTCCGTCAG CCAGTTTAGT CTGACCATCT CATCTGTAAC ATCATTGGCA ACGCTACCTT 
1861 TGCCATGTTT CAGAAACAAC TCTGGCGCAT CGGGCTTCCC ATACAAGCGA TAGATTGTCG 
llll cacctgIttg CCCGACATTA TCGCGAGCCC ATTTATACCC ATATAAATCA GCATCCATGT 
llll TGGAATTTAA TCGCGGCCTC GACGTTTCCC GTTGAATATG GCTCATAACA CCCCTTGTAT 
'2041 TACTGTTTAT GTAAGCAGAC AGTTTTATTG TTCATGATGA TATATTTTTA TCTTGTGCAA 
2101 TGTAACATCA GAGATTTTGA GACACGGGCC AGAGCTGCAG CTGGATGGCA AATAATGATT 
2161 TTATTTTGAC TGATAGTGAC CTGTTCGTTG CAACAAATTG ATAAGCAATG CTTTCTTATA 
llll ATGCCAACTT TGTACAAGAA AGCTGAACGA GAAACGTAAA ATGATATAAA TATCAATATA 
2281 TTAAATTAGA TTTTGCATAA AAAACAGACT ACATAATACT GTAAAACACA AC AT AT C C AG 
2341 TCACTATGAA TCAACTACTT AGATGGTATT AGTGACCTGT AGTCGACTAA GTTGGCAGCA 
2401 TCACCCGACG CACTTTGCGC CGAATAAATA CCTGTGACGG AAGATCACTT CGCAGAATAA 
246^ ATAAATCCTG GTGTCCCTGT TGATACCGGG AAGCCCTGGG CCAACTTTTG GCGAAAATGA 
2521 GACGTTGATC GGCACGTAAG AGGTTCCAAC TTTCACCATA ATGAAATAAG ATCACTACCG 
2 581 GGCGTATTTT TTGAGTTATC GAGATTTTCA GGAGCTAAGG AAGCTAAAAT GGAGAAAAAA 
2 641 ATCACTGGAT AT AC C AC CGT TGATATATCC CAATGGCATC GTAAAGAACA TTTTGAGGCA 
2 7 0i SSagtcag TTGCTCAATG TACCTATAAC CAGACCGTTC AGCTGGATAT TACGGCCTTT 



ss ssss Eii ii= S iiiii 

2 881 TGGGATAGTG TTCACCCTTG TTACACCGTT ^CATGAGC GCAAGATGTG 
2941 CTCTGGAGTG AAT AC C ACG A CGATTTCCGG CAGTTTCTAC £ATA 

3001 GCGTGTTACG GTGAAAACCT G^CTATTTC ^CTAAAGGGT CAATATGGAC 

3061 ™ C co SSSS SSSSS SSSSS cSSaggcga CAAGGTGCTG 

3121 AACTTCTTCG CCCCCGTTTT CAL ^^ PTPTPTrATG GCTTCCATGT CGGCAGAATG 

Si 5552 SEE S5SS 53=5 !_« — « 

us ssss =S SSI iiE ssss see 

3421 TTACAGTGAC AGTTGACAGC GACAGCTATC CGTCTGCGTG CCGAACGCTG 

3481 CTCCGGTCTG GTAAGCACAA CCATGCAGAA TGAAGCCCGT CGTCTGCG^ c 
3541 GAAAGCGGAA AATCAGGAAG GGATGGCTGA JGTCGCCCGG JTTATTGAAA 

- = ~ =i i= Ssss ssss 

3721 GGATGGTGAT CCCCCTGGCC J^^C GGCGCATGAT GACCACCGAT ATGGCCAGTG 
3781 AC C CGGTGGT GCATATCGGG ^GAAAGCl ptGATCTCAG CCACCGCGAA AATGACATCA 

"E SSSSS TAACCTGATG SSSSS AGGCTCCCTT ATACACAGCC 

3901 AAAACGCCAT TAACC1fc.AH* ^ riaaCTTTAT CACGTTTAGT AAGTATAGAG 

3961 AGTCTGCAGG T CG ATACAGT AGAA^TTACA GAAACTTTAT J£ TTGTGAAATT 
40 21 GCTGAAAATC CAGATGAAGC CGAACGACTT ^AAGAG^ QGTAGATGTT 

till cacacaaIIa agaggctcgc ACCTCTTTTT CTTATTTCTT TT TATGA TTT 
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„t„= \ n pne Encoded 
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CmR 
attP2 



47 . .131 
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1509.- 2082 orl 
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3 812.. 4117 ccdB 



CGGCTTCATT CTGCATGGTT GTGCTTACCA GACCGGAGAT 
1 GCGTTCGGCA CGCAGACGAC GGGCTTCA 1 1 ~ c tGTCAACTGT CACTGTAATA 

61 ATTGACATCA TATATGCCTT GAGCAACTGA ^GCTGTCGC ^TCAA GTATATATTC 
121 CGCTGCTTCA TAGCACACCT CTTTTTGACA TC TGGCTTTT AGTAAGCCGG 

131 TTATACCGCA AAAATCAGCG C^TACG C^CAGTA CTGTTGTAAT TCATTAAGCA 
241 ATCCACGCGT TTACGCCCCG CCCTGCCACU CTGAATCGCC AGCGGCATCA 

3 01 TTCTGCCGAC ATGGAAGCCA TCACAGACGG CATGATGAAC AAGAAGTTGT 
361 GCACCTTGTC GCCTTGCGTA TAATATTTGC CCATGGTGAA JACG^^ 
421 CCATATTGGC CACGTTTAAA TCAAAACTGG ™£££*G GTTTTCACCG TAACACGCCA 
481 AAAACATATT CTCAATAAAC CCTTTAGGGA AATAGGCCAG ^ CTCCAGAGCG 

541 CATCTTGCGA ATATATGTGT AGAAACTGCC JGAAAT AGGGTGAACA CT AT C C CAT A 

6 01 ATGAAAACGT TTCAGTTTGC *CATGGAAAA CGGTGTAACA A ATCAGGCGGG 
66 1 TCACCAGCTC ACCGTCTTTC ATTGCCATAC 8^.^,, GTCTTTAAAA 
721 CAAGAATGTG AATAAAGGCC GGATAAAACT TGTGCTTATT ^ GACTGAAATG 

781 AGGCCGTAAT ATC C AGCTGA ACGGTCTGGT JATAGGT GGTGGTATAT CCAGTGATTT 

841 CCTCAAAATG TTCTTTACGA TGCCATTGGG JTATATC AATACGCCCG 

8 90 i ttttctccat tttagcttcc ttagctcctg aaaatctcga taactC ccga tcaacgtctc 
96i gtagtgatct tatttcatta tggtgaaagt tggaacc AGGGACACCA ggatttattt 
1021 attttcgcca aaagttggcc cagggcttcc cggt GCAAAGTGCG tcgggtgatg 

1081 attctgcgaa gtgatcttcc gtcacaggta Jttatt gtagttgatt catagtgact 
ii4i ctgccaactt agtcgactac aggtcactaa taccatct tc taatttaata 

i20i ggatatgttg tgttttacag JATTJTGTAG tctgtttttt GTTGGC atta 
i26i tattgatatt tatatcjlttt tacgtttctc ^tcagcT gt cactatcagt caaaataaaa 

1321 TAAGAAAGCA TTGCTTATCA ATTTGTTGCA ^ caqaATCAGG GGATAACGCA 

1381 TCATTATTTG CCATCCAGCT AGCGGTAATA CGGTTATCC GG CCGCGTTG 
14 41 GGAAAGAACA TGTGAGCAAA AGGCCAGCAA ^ ACAAAAATCG ACGCTCAAGT 

1501 CTGGCGTTTT TCCATAGGCT C08COCCCCT JJCGAGCATC TGQAAGCTCC 
1561 CAGAGGTGGC GAAACCCGAC AJGACTATAA A ACC TGTCCGC CTTTCTCCCT 

1621 CTCGTGCGCT CTCCTGTTCC GA CCCTGCCG J™ ATC TCAGTTC GGTGTAGGTC 

1681 TCGGGAAGCG TGGCGCTTTC TCATAGCTCA CGCTGTAGGT Q CTQCGCCTTA 

1741 GTTCGCTCCA AGCTGGGCTG TGTGCA.CGAA CCCCCCGTTC ACTGGCAGCA 
1801 TCCGGTAACT ATCGTCTTGA STCCAACCCG STAAGACACG GTTCTTGAAG 
1861 GCCACTGGTA ACAGGATTAG ^^CGAGG ^TGTA QTATCTGCGC TCTGCTGAAG 

1921 TGGTGGC CT A ACTACGGCTA CACTAGAAGA ^AGTATTTG c CACCGCTGGT 

19 81 CCAGTTACCT T CGGAAAAAG AGTTGGTAGC ™ATCCG ATCTCAAGAA 
' 2041 AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG ATTACGCGC^ AGG -pxAAGGG 
2101 GATCCTTTGA TCTTTTCTAC G ^TCTGAC AATGCTCTGC CAGTGTTACA 

2161 ATTTTGGTCA TGAGCTTGCG CCGTCCCGTC JAGTCAGCGT AA^ ^ 
2221,ACCAATTAAC CAATTCTGAT TAGAAAAACT CATCGAGCAT ^ GAAGGAGAA A 

2281 TCATATCAGG ATTATCAATA CCATATTTTT G ^^ G TCGGTCTGCG ATTCCGACTC 

2341 ACTCACCGAG GCAGTTCCAT AGGATGGCAA ^CTGGTA ^ TCAAGTGAGA 

2401 GTCCAACATC AATACAACCT ATTAATTTCC ^^CGTCAAA ATTTCTTTCC 
2461 AATCACCATG -TGACGACT ^f^O AGAATGGCAA -G^^ TCAACCAAAC 

llll CGTTATTCAT ~ GACGAAATAC — , 

2 2 6 7 4 oi USSSS tgt™ ggg™. 





2761 TGGTGAGTAA CCATGCATCA TCAGGACTAC GGA.AAAATG CTTGATGG.C G^AAGAG^CA 
2821 TAAATTCCGT CAGCCAGTTT AGTCTGACCA TCTCATCTGT ^A CGATAGATTG 
288 1 CTTTGCCATG TTTCAGAAAC AACTCTGGCG CATCGGGCTT TC AGC ATC CA 

2 941 TCGCACCTGA TTGCCCGACA TTATCGCGAG ^CATTTATA AC ACCCCTTG 
3001 TGTTGGAATT TAATCGCGGC CTCGACGTTT CCCGTTGAAT ttatcttqtg 
3061 TATTACTGTT TATGTAAGCA GACAGTTTTA TTGTTCA1 GGATCTCGGG 
3121 CAATGTAACA TCAGAGATTT TGAGACACGG ^AGAGCTG CAGCT 

3181 CCCCAAATAA TGATTTTATT TTGACTGATA ^GACCTGTl GTAAAATGAT 
324 1 CAATGCTTTT TTATAATGCC ^TTTGTAC £J£^ AGACTACATA ATACTGTAAA 
3301 ATAAATATCA ATATATTAAA TTAGATTTTG JATAAAAA* C CTGTAGTCG 
3361 ACACAACATA TCCAGTCACT ATGAATCAAC JACTTAGATG ° CGACAGCCTT 
3421 ACCGACAGCC TTCCAAATGT JCTTCGGGTG ATGCTGCCAA C TCCTCAATGC 
3481 CCAAATGTTC TTCTCAAACG G^GTCGT JTCCMOT TGTGTGACAA 
3541 CGTATTAAAT CATAAAAAGA ^TAAGAAAA J^GTGCGA TCCTGAAAAT CATCTGCATC 
3601 AATAAAAACA TCTACCTATT CATATACGCT JGTGTCATAG ^ CATCTGGATT 

3661 AAGAACAATT TCACAACTCT TATACTTTTC JCTTACAAGT T ATCGAC CTG 

3721 TTCAGCCTCT ATACTTACTA JACGTGATAA AGTTTCTGTA f ^ 
37 81 C AGACTGGC T GTGTATAAGG GAGCCTGACA ^TATAl ATAACGGAGA 
3841 CGTTTTTGAT GTCATTTTCG CGGTGGCTGA GJTCAGCCAC TTCT ATATGCACCA 
3901 CCGGCACACT GGCCATATCG GTGGTCATCA JGCGCCAGCT AGGGGGATCA 
3961 C CGGGT AAAG TTCACGGGAG ^TTTATCTG J^AGACG ^CCACAAAC AGACGATAAC 
4021 CCATCCGTCG CCCGGGCGTG TCAATAATAT JACTCTGTAC GTTCTCGTCA 
4081 GGCTCTCTCT TTTATAGGTG ^AAACCTTAA JCTGCATTTC JCCA CTGATTTTCC 
4141 GCAAAAGAGC CGTTCATTTC AATAAAC CGG GCGACCTCAG tLAl 
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- = ~ ilii iiii = = 
Si =i liE iEE sss = ssss 

HI iiE == EE ™ 1= ii= 

s sssss ees = i ~ sssss S5SS 

™ ™I Ess EE ™* sss ~? 

661 J^SS SSSS SaSgcgat ccgcgggaaa acagcattcc aggtattaga 

III =?S ga?=g aaaa^ot ™cc 

„i gcattcgatt ^tgtttgta ™SSagt gattttgatg acgagcgtaa 

IS fcSA = ACG ^CA, TC,™ 

- =Ii iiii iiii sssss ssss 

1141 CCTATGGAAC TGCCTCGGTG AGTailui^ TTGATGCTCG ATGAGTTTTT 

si ssss = ~ Eiic sees ssss 
su sees = ssss iii h S5SSS ssss 

1441 AACAAAAAAA CCACCGCTAC ^CGGTGGT TTCTAGTGTA 
1501 TTTTCCGAAG ^AACTGGCT TCAGCAGAGC ^AGATACCA TCGCTCTGCT 
1561 GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTCTTACCG GGTTGGACTC 

1621 AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTC CGTGCACACA 
1681 AAGACGATAG TTACCGGATA AGGCGCAGCG J^GGGCTGA J AGCTATGAGA 

1741 XgSccacg SESSES ggISSaggc SSSSSS CCGGTAAGCG GCAGGGTCGG 

1801 AAGCGCCACG CTTCCCGAA^ _, IiIirrrr TGGTATCTTT ATAGTCCTGT 

1861 AACAGGAGAG CGCACGAGGG JGCTTCCAGG ^^GCC TGGTATCT^ 

1921 CGGGTTTCGC CACCTCTGAC TTGAGCGTCG JTTTTTGTG GCTGGCCTTT 

l 98 r CCTATGGAAA AACGCCAGCA JCGCGGCCTT StCTGTG GATAACCGTA TTACCGCTAG 

2 041 TGCTCACATG TTCTTTCCTG CGTTATCCCC ^11^- QCAACAAATT 

2101 SSSSS SSSS ™™ SSgIaCG AGAAACGTAA 

2161 GATAAGCAAT GCTTTTTTA 1 nTTTTGCATA AAAAACAGAC TACATAATAC 

22 21 AATGATATAA ATATCAATAT JTTAAATTAG JTTTTGCATA t*™^ TAQTGACCTG 
228 1 TGTAAAACAC AACATATCCA JTCACTATGA ™ CCGAATAAAT ACCTGTGACG 

2341 TAGTCGACTA AGTTGGCAGC ATCACCCGAC ^ACTl TTGAT AC CGG GAAGCCCTGG 

2401 GAAGATCACT TCGCAGAATA AATAAATCCT JGTGTCCCTG JTGAT ATACTTTTCT 
lie, GCCAACTTTT -CGAAAATG ^GTTGAT CGGCACJTTT -CAACT^ 
2521 CTTACAAGTC GTTCGGCTTC ATCTQGATTT ^ TGTATAACGG AGCCTGACAT 

2581 GTTTCTGTAA TTTCTACTGT ATCGACCTGC AGACTGGCTG GGTGGCTGAG 
2641 'TTATATTCCC C AG AAC AT C A JGTTAATGGC ^TTTTGATG ^ATT TGGT CAT CAT 

2701 ATCAGCCACT TCTTCCCCGA ^AACGGAGAC CGGCACACTG ^C^^ CTTTATCTGA 
2761 GCGCCAGCTT ^CATCCCCGA ™TGCACCAC CGGG CCGGGCGTGT CAATAATATC 

2 821 CAGCAGACGT GCACTGGCCA GGGGGATCAC CATCC£1 TTATAGGTGT AAACCTTAAA 

2881 ACTCTGTACA -CACAAACA --ATAACG OCTCTC-TT -ATAGGT^ ^ 

XE ££S£ Tgatt-cg ™gcg ™cgc agacgacggg 

3 3 °i 6 2 i = SESSS SacScac =cgc tgcttcatag cacacctctt- 





3181 TTTGACATAC TTCGGGTATA CATATCAGTA TATATTCTTA TACCGCAAAA ATCAGCGCGC 
\Zl IIIScScAT ACTGTTATCT GGCTTTTAGT AAGCCGGATC CACGCGTTTA CGCCCCGCCC 
3301 ^caSSt cgcagtactg TTGTAATTCA TTAAGCATTC TGCCGACATG GAAGCCATCA 
cagacgScS gaSaacctg AATCGCCAGC GGCATCAGCA CCTTGTCGCC TTGCGTATAA 
lltl ?a??SccS SSgISaac gggggcgaag aagttgtcca tattggccac gtttaaatca 
\Al ISSggtga aactcaccca gggattggct gagacgaaaa acatattctc aataaaccct 
Mil a^gccaggtt ttcaccgtaa cacgccacat cttgcgaata tatgtgtaga 

HcJgcSgI aScgtcgtg gtattcactc cagagcgatg ----- 
3661 tggaaaacgg tgtaacaagg gtgaacacta tcccatatca ccagctcacc gtctttcatt 
\¥>\ tSa^gga attccggatg agcattcatc aggcgggcaa gaatgtgaat aaaggccgga 
3781 ?aaa!c"SS SSSSttt ctttacggtc tttaaaaagg ccgtaatatc cagctgaacg 

llll GTCTGGTTAT AGGTACATTG AGCAACTGAC TGAAATGCCT CAAAATGTTC TTTACGATGC 

llol StSggSI ?Scaacggt ggtatatcca gtgatttttt tctccatttt agcttcctta 
llll GCTCCTGAAA atctcgataa ctcaaaaaat acgcccggta gtgatcttat ttcattatgg 
llll ?gaSS^ aIcctcttac tgttcttgat gcagatgatt ttcaggacta tgacactagc 

40 81 ATATATGAAT AGGTAGATGT TTTTATTTTG TCACACAAAA AAGAGGCTCG CACCTCTTTT 
4141 TCTTATTTCT TTTTATGATT TAATA 



Figure 53A; pDONR205 (tetR) 




P DONR205 4939 bp 



GGCATCAGCACCTTGTCGCCTTGCGTATAATATTTGCCCATGGTGAAAACGGGGGCGAAG 
AAGTTGTCCATATTGGCCACGTTTAAATCAAAACTGGTGAAACTCACCCAGGGATTGGCT 
GAGACGAAAAACATATTCTCAATAAACCCTTTAGGGAAATAGGCCAGGTTTTCACCGTAA 
CACGCCACATCTTGCGAATATATGTGTAGAAACTGCCGGAAATCGTCGTGGTATTCACTC 
CAGAGCGATGAAAACGTTTCAGTTTGCTCATGGAAAACGGTGTAACAAGGGTGAACACTA 
TCCCATATCACCAGCTCACCGTCTTTCATTGCCATACGGAATTCCGGATGAGCATTCATC 
AGGCGGGCAAGAATGTGAATAAAGGCCGGATAAAACTTGTGCTTATTTTTCTTTACGGTC 
TTTAAAAAGGCCGTAATATCCAGCTGAACGGTCTGGTTATAGGTACATTGAGCAACTGAC 
TGAAATGCCTCAAAATGTTCTTTACGATGCCATTGGGATATATCAACGGTGGTATATCCA 
GTGATTTTTTTCTCCATTTTAGCTTCCTTAGCTCCTGAAAATCTCGATAACTCAAAAAAT 
ACGCCCGGTAGTGATCTTATTTCATTATGGTGAAAGTTGGAACCTCTTACGTGCCGATCA 
ACGTCTCATTTTCGCCAAAAGTTGGCCCAGGGCTTCCCGGTATCAACAGGGACACCAGGA 
TTTATTTATTCTGCGAAGTGATCTTCCGTCACAGGTATTTATTCGGCGCAAAGTGCGTCG 
GGTGATGCTGCCAACTTAGTCGACTACAGGTCACTAATACCATCTAAGTAGTTGATTCAT 
AGTGACTGGATATGTTGTGTTTTACAGTATTATGTAGTCTGTTTTTTATGCAAAATCTAA 
TTTAATATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTCTTGTACAAAGTT 
GGCATTATAAGAAAGCATTGCTTATCAATTTGTTGCAACGAACAGGTCACTATCAGTCAA 
AATAAAATCATTATTTGCCATCCAGCTGCAGCTCTGGCCCGTGTCTCAAAATCTCTGATG 
TTACATTGCACAAGATAAAAATATATCATCATGAATTCTCATGTTTGACAGCTTATCATC 
GATAAGCTTTAATGCGGTAGTTTATCACAGTTAAATTGCTAACGCAGTCAGGCACCGTGT 
ATGAAATCTAACAATGCGCTCATCGTCATCCTCGGCACCGTCACCCTGGATGCTGTAGGC 
ATAGGCTTGGTTATGCCGGTACTGCCGGGCCTCTTGCGGGATATCGTCCATTCCGACAGC 
ATCGCCAGTCACTATGGCGTGCTGCTAGCGCTATATGCGTTGATGCAATTTCTATGCGCA 
CCCGTTCTCGGAGCACTGTCCGACCGCTTTGGCCGCCGCCCAGTCCTGCTCGCTTCGCTA 
CTTGGAGCCACTATCGACTACGCGATCATGGCGACCACACCCGTCCTGTGGATCCTCTAC 
GCCGGACGCATCGTGGCCGGCATCACCGGCGCCACAGGTGCGGTTGCTGGCGCCTATATC 
GCCGACATCACCGATGGGGAAGATCGGGCTCGCCACTTCGGGCTCATGAGCGCTTGTTTC 
GGCGTGGGTATGGTGGCAGGCCCCGTGGCCGGGGGACTGTTGGGCGCCATCTCCTTGCAT 
GCACCATTCCTTGCGGCGGCGGTGCTCAACGGCCTCAACCTACTACTGGGCTGCTTCCTA 
ATGCAGGAGTCGCATAAGGGAGAGCGTCGACCGATGCCCTTGAGAGCCTTCAACCCAGTC 
AGCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTT 
ATCATGCAACTCGTAGGACAGGTGCCGGCAGCGCTCTGGGTCATTTTCGGCGAGGACCGC 
TTTCGCTGGAGCGCGACGATGATCGGCCTGTCGCTTGCGGTATTCGGAATCTTGCACGCC 
CTCGCTCAAGCCTTCGTCACTGGTCCCGCCACCAAACGTTTCGGCGAGAAGCAGGCCATT 
ATCGCCGGCATGGCGGCCGACGCGCTGGGCTACGTCTTGCTGGCGTTCGCGACGCGAGGC 
TGGATGGCCTTCCCCATTATGATTCTTCTCGCTTCCGGCGGCATCGGGATGCCCGCGTTG 
CAGGCCATGCTGTCCAGGCAGGTAGATGACGACCATCAGGGACAGCTTCAAGGATCGCTC 
GCGGCTCTTACCAGCCTAACTTCGATCATTGGACCGCTGATCGTCACGGCGATTTATGCC 
GCCTCGGCGAGCACATGGAACGGGTTGGCATGGATTGTAGGCGCCGCCCTATACCTTGTC 
TGCCTCCCCGCGTTGCGTCGCGGTGCATGGAGCCGGGCCACCTCGACCTGAATGGAAGCC 
GGCGGCACCTCGCTAACGGATTCACCACTCCAAGAATTGGAGCCAATCAATTCTTGCGGA 
' GAACTGTGAATGCGCAAACCAACCCTTGGCAGAACATATCCATCGCATGACCAAAATCCC 
TTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTC 
TTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACC 
AGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTT 
CAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTT 
CAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGC 
TGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAA 
GGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGAC 
CTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGG 
GAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGA 
GCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACT 
TGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAA- 



GCTAGCATGGATCTCGGGCCCCAAATAATGAT 



_ ^TTTTATTTTGACTGATAGTGACCTGTTCG 
rTTTTTATAATGCCAACTTTGTACAAAAAAGCTGAA 



"TCAAACGGAATCGTCGTATCCAGCCTACT 
rATTGTCCTCAATGCCGTATTAAATCATAAAAAGAAATAAGAAAAAGAGGTGCGAGC 



cgct. 

CTCT 



TTTTTGTGTGACAAAATAAAAACATCTA 



.CCTATTCATATACGCTAGTGTCATAGTC 



CTG. 



TTCG 1 



lA AAATCATCTGCATCAAGAACAATTTCA 
,GCTTCATCTGGATTTTCAGCCTCTATA 



.CAACTCTTATACTTTTCTCTTACAAGTCG 



.CTTACTAAACGTGATAAAGTTTCTGTAAT 
CATCCCCGATATGCACCACCGGGTAAAGTTCA 

CAGTCCCTGTTCTCGTCAGCAAAAGAGCCGT^^^^^^^^^^^^^^^^^^^^^^^^^^ 
ATGGTTGTGCTTACCAGACCGGAGATATTGACAT 



CGGGAGACTTTATCTGACAGCAGACGTG 
CCGGGCGTGTCAATAATATCACTCTGTACAT 



CCAC C^S?^CG^AGCAAAAGAGC^ 

ATGAACCTGAATCGCCAGC 



Figure 5^ pDO NR2^en^ 




BSSSI 
Drdl 



Tth111l 



Avail 
DraMl 



Nhel ... 
: Apal 
.Banll 
UEco0109I 
j^PspOMI 



\ ctnoi 




l-,. Btrl 
Srfl 
/ Smal 
Xmal 

BbvCI 



Bst1107l 
BssHII 
BamHI 
Mlul 
Seal 



Sspl 
Ncol 



\ Bpml 
Psp1406l 
Apol 
EcoRI 
Acclll 



# 




pDONR206 4415 bp 



CGGCATTGAGGACAATAGCGAGTAGGCTGGATACGACGATTCCGTTTGAGAAGAACATTT 
GGAAGGCTGTCGGTCGACTACAGGTCACTAATACCATCTAAGTAGTTGAATCATAGTGAC 
TGGATATGTTGTGTTTTACAGTATTATGTAGTCTGTTTTTTATGCAAAATCTAATTTAAT 
ATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTTTTGTACAAAGTTGGCATT 
ATAAAAAAGCATTGCTTATCAATTTGTTGCAACGAACAGGTCACTATCAGTCAAAATAAA 
ATCATTATTTGGGGCCCGAGATCCATGCTAGCGGTAATACGGTTATCCACAGAATCAGGG 
GATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAG 
GCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGA 
CGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCT 
GGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCC 
TTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCG 
GTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGC 
TGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCA 
CTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAG 
TTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCT 
CTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACC 
ACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGA 
TCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCA 
CGTTAAGGGATTTTGGTCATGNCGCCGTCCCGTCAAGTCAGCGTAATGCTCTGCCAGTGT 
TACAACCAATTAACCAATTCTGATTAGAAAAACTCATCGAGCATCAAATGAAACTGCAAT 
TTATTCATATCAGGATTATCAATACCATATTTTTGAAAAAGCCGTTTCTGTAATGAAGGA 
GAAAACTCACCGAGGCAGTTCCATAGGATGGCAAGATCCTGGTATCGGTCTGCGATTCCG 
ACTCGTCCAACATCAATACAACCTATTAGCCGAGGTCTTCCGATCTCCTGAAGCCAGGGC 
AGATCCGTGCACAGCACCTTGCCGTAGAAGAACAGCAAGGCCGCCAATGCCTGACGATGC 
GTGGAGACCGAAACCTTGCGCTCGTTCGCCAGCCAGGACAGAAATGCCTCGACTTCGCTG 
CTGCCCAAGGTTGCCGGGTGACGCACACCGTGGAAACGGATGAAGGCACGAACCCAGTTG 
ACATAAGCCTGTTCGGTTCGTAAACTGTAATGCAAGTAGCGTATGCGCTCACGCAACTGG 
TCCAGAACCTTGACCGAACGCAGCGGTGGTAACGGCGCAGTGGCGGTTTTCATGGCTTGT 
TATGACTGTTTTTTTGTACAGTCTATGCCTCGGGCATCCAAGCAGCAAGCGCGTTACGCC 
GTGGGTCGATGTTTGATGTTATGGAGCAGCAACGATGTTACGCAGCAGCAACGATGTTAC 
GCAGCAGGGCAGTCGCCCTAAAACAAAGTTAGGTGGCTCAAGTATGGGCATCATTCGCAC 
ATGTAGGCTCGGCCCTGACCAAGTCAAATCCATGCGGGCTGCTCTTGATCTTTTCGGTCG 
TGAGTTCGGAGACGTAGCCACCTACTCCCAACATCAGCCGGACTCCGATTACCTCGGGAA 
CTTGCTCCGTAGTAAGACATTCATCGCGCTTGCTGCCTTCGACCAAGAAGCGGTTGTTGG 
CGCTCTCGCGGCTTACGTTCTGCCCAGGTTTGAGCAGCCGCGTAGTGAGATCTATATCTA 
TGATCTCGCAGTCTCCGGCGAGCACCGGAGGCAGGGCATTGCCACCGCGCTCATCAATCT 
CCTCAAGCATGAGGCCAACGCGCTTGGTGCTTATGTGATCTACGTGCAAGCAGATTACGG 
TGACGATCCCGCAGTGGCTCTCTATACAAAGTTGGGCATACGGGAAGAAGTGATGCACTT 
TGATATCGACCCAAGTACCGCCACCTAACAATTCGTTCAAGCCGAGATCGGCTTCCCGGC 
CTAATTTCCCCTCGTCAAAAATAAGGTTATCAAGTGAGAAATCACCATGAGTGACGACTG 
AATCCGGTGAGAATGGCAAAAGCGTATGCATTTCTTTCCAGACTTGTTCAACAGGCCAGC 
' ' CATTACGCTCGTCATCAAAATCACTCGCATCAACCAAACCGTTATTCATTCGTGATTGCG 
CCTGAGCGAGACGAAATACGCGATCGCTGTTAAAAGGACAATTACAAACAGGAATCGAAT 
GCAACCGGCGCAGGAACACTGCCAGCGCATCAACAATATTTTCACCTGAATCAGGATATT 
CT;TCTAATACCTGGAATGCTGTTTTCCCGCGGATCGCAGTGGTGAGTAACCATGCATCAT 
CAGGAGTACGGATAAAATGCTTGATGGTCGGAAGAGGCATAAATTCCGTCAGCCAGTTTA 
GTCTGACCATCTCATCTGTAACATCATTGGCAACGCTACCTTTGCCATGTTTCAGAAACA 
ACTCTGGCGCATCGGGCTTCCCATACAATCGAAAGATTGTCGCACCTGATTGCCCGACAT 
TATCGCGAGCCCATTTATACCCATATAAATCAGCATCCATGTTGGAATTTAATCGCGGCC 
TCCAGCAAGACGTTTCCCGTTGAATATGGCTCATAACACCCCTTGTATTACTGTTTATGT 
AAGCAGACAGTTTTATTGTTCATGATGATATATTTTTATCTTGTGCAATGTAACATCAGA 
GATTTTGAGACACGGGCCCNGCGCACTGCAGCTGGATCGGCAAATAATGATTTTATTTTG 
ACTGATAGTGACCTGTTCGTTGCAACAAATTGATAAGCAATGCTTTTTTATAATGCCAAC 



TTTGTACAAGAAAGCTGAACGAGAAACGTAAAATGATATAAATATCAATATATTAAATTA 

GATTTTGCATAAAAAACAGACTACATAATACTGTAAAACACAACATATCCAGTCACTATG 

ATTCAACTACTTAGATGGTATTAGTGACCTGTAGTCGACTAAGTTGGCAGCATCACCCGA 

CGCACTTTGCGCCGAATAAATACCTGTGACGGAAGATCACTTCGCAGAATAAATAAATCC 

TGGTGTCCCTGTTGATACCGGGAAGCCCTGGGCCAACTTTTGGCGAAAATGAGACGTTGA 

TCGGCACGTAAGAGGTTCCAACTTTCACCATAATGAAATAAGATCACTACCGGGCGTATT 

TTTTGAGTTATCGAGATTTTCAGGAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTGG 

ATATACCACCGTTGATATATCCCAATGGCATCGTAAAGAACATTTTGAGGCATTTCAGTC 

AGTTGCTCAATGTACCTATAACCAGACCGTTCAGCTGGATATTACGGCCTTTTTAAAGAC 

CGTAAAGAAAAATAAGCACAAGTTTTATCCGGCCTTTATTCACATTCTTGCCCGCCTGAT 

GAATGCTCATCCGGAATTCCGTATGGCAATGAAAGACGGTGAGCTGGTGATATGGGATAG 

TGTTCACCCTTGTTACACCGTTTTCCATGAGCAAACTGAAACGTTTTCATCGCTCTGGAG 

TGAATACCACGACGATTTCCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTA 

CGGTGAAAACCTGGCCTATTTCCCTAAAGGGTTTATTGAGAATATGTTTTTCGTCTCAGC 

CAATCCCTGGGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAATATGGACAACTTCTT 

CGCCCCCGTTTTCACCATGGGCAAATATTATACGCAAGGCGACAAGGTGCTGATGCCGCT 

GGCGATTCAGGTTCATCATGCCGTCTGTGATGGCTTCCATGTCGGCAGAATGCTTAATGA 

ATTACAACAGTACTGCGATGAGTGGCAGGGCGGGGCGTAAACGCGTGGATCCGGCTTACT 

AAAAGCCAGATAACAGTATGCGTATTTGCGCGCTGATTTTTGCGGTATAAGAATATATAC 

TGATATGTATACCCGAAGTATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTG 

ACAGTTGACAGCGACAGCTATCAGTTGCTCAAGGCATATATGATGTCAATATCTCCGGTC 

TGGTAAGCACAACCATGCAGAATGAAGCCCGTCGTCTGCGTGCCGAACGCTGGAAAGCGG 

AAAATCAGGAAGGGATGGCTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCTTTTGCTG 

ACGAGAACAGGGACTGGTGAAATGCAGTTTAAGGTTTACACCTATAAAAGAGAGAGCCGT 

TATCGTCTGTTTGTGGATGTACAGAGTGATATTATTGACACGCCCGGGCGACGGATGGTG 

ATCCCCCTGGCCAGTGCACGTCTGCTGTCAGATAAAGTCTCCCGTGAACTTTACCCGGTG 

GTGCATATCGGGGATGAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGTGCCGGTC 

TCCGTTATCGGGGAAGAAGTGGCTGATCTCAGCCACCGCGAAAATGACATCAAAAACGCC 

ATTAACCTGATGTTCTGGGGAATATAAATGTCAGGCTCCGTTATACACAGCCAGTCTGCA 

GGTCGATACAGTAGAAATTACAGAAACTTTATCACGTTTAGTAAGTATAGAGGCTGAAAA 

TCCAGATGAAGCCGAACGACTTGTAAGAGAAAAGTATAAGAGTTGTGAAATTGTTCTTGA 

TGCAGATGATTTTCAGGACTATGACACTAGCATATATGAATAGGTAGATGTTTTTATTTT 

GTCACACAAAAAAGAGGCTCGCACCTCTTTTTCTTATTTCTTTTTATGATTTAATA 



/ 



Figure '.53" 



&rfyy {j&Jfcl) Clone of CAT Subcloned into 

i>]>esr2_ 



1021 



egg ata aca att tea cac agg aaa cag acc | atg" tcjf tifc ?2S cat cac cat: 
gee tat tgt taa agt gtg tec ttt gtc tgg tac age atg atg gta gtg gta 



fctBi- 




1072 



Hx\ Hi> fhs <b(u He T>v~ 5<ar Lgu W Us M* Pho GU< As*, L^r 

cac cat cac gg£ ate jaca agt tgg tac aaa aia gca ggc'tfct gaa aac ctg 
gtg gta gtg ccg tag ]t:qt tea aac atg tttlttt cgt ccg afr a ctt ttg gac 



1123 



^GU Dv Met G«m U/s U/3 
l ggd acc atg gag aaa aaa 



tat ttt at* ggi acc atg ga^r aia aia ate ac"t ggk tit acc acc gtt gSft 
ata aaa gtt cct tgg tac etc ttt ttt tag tga cct ata tgg tgg caa eta 



^ 0»k^ Tkr V/a(^ Av> 



5029J>put(M 



MluU3 

BclU7 

BuHI.194 

,Bspl20l.220 

:Ap*U20 



4*153*1107.. 
46l4.Nael. 
4614.NtoMl\ 

450SX>ram.. 



,B$pML&65 
RsrfL993 



4246.Sapl.. 
4124.BspLUU.. 



3710jMwN1 




..BspEUM7 



BiH.1616 



^Kpol.1918 
Wsp7lS,1918 
HitxflIU924 



3 231 AMI 



3011J=spI 

2S64JMU 



1843 

N<JcI 

StuI 

Sail 

S*cl 

Ecll 36 

Spcl 

Nod 

Xraam 

BamHI 

1900 





o 

U1 

4" 
01 




X 



4 2 3 

attB (25 tp) 



TP 



yj 

O 

HJ 

o 
o 



Integration 
(3nt, HF) 



1 Excision 
Unt,Xis,IHF> 



E. coli Genccne attL 



attR 



wn — •//- 



4 2 5 



Lambda Genome 



1 2 3 








6^ 



MLu I (reading frame A) 
Bgl II (reading frame B) 
O Xba I (reading frame C) 

i 
Ui 





,Xbal,257 
:BamHI # 263 
^EcoRV.278 




EcoRI,2007 




III 



3307 .Gsul 



2590 r Sspl 



.BamH!,263 
;\EcqRV,278 
.Notl,410 
,BamHI,481 




EC0Rl,731 
,.Gsut,832 



.Ball,996 
/Ncol.1032 
^Sspl.1041 



\Bglll.117B 
BamHI,1184 



Ball, 1625 
Ball.1721 



rEcoRV f 199l 
EcoRI,2009 



Xbal,257 
:,BamHI,263 
*.EcoRV,278 
.Not!,411 
' ,BamHl,482 




^coRl.732 
_.Gsul,833 



,8811,997 
./Nool.1033 
■:.SspM042 



;\Xbal.H79 
\BamHl,1185 
Bs$HI1,1226 



BalMe26 
Ball,1722 



2592, Sspl 



\ EcoRV p 1993 
EcoRI.2011 



■for Clci^ ~&LCo^.<fe6»«- 



\ ► 



-I ► 



foible- ^c^ecs 



Sca\ 3846 
Pvu\ 3735 
Psfl 3609 

Ppa\ 3435 



EcoRI 4361 C/al 23 

\ / H/ndlll 29 
Aafll 4286V/ EcoRV 185 

A/hel 229 

BamHI 375 



Sspl 4170 




Spftl 562 
Sa/I 651 




pBR322 
4363bp 



Eag\ 939 
Nrul 972 
SspMI 1063 



AfhU 2475 
/Vdel 2297 




Bsm\ 1353 
Sfyl 1369 
Aval 1425 

Sa/I 1444 
BspMII 1664 

Pvull 2066 



TYM 111 2219 



Results of Cloning 
tet and amp PCR Products 
by Recombination 



PCR Product ! No. Colonies 
Used in GCS Obtained 
Reactions I a°o «i P i»ted) 



tet 
attB-tet 
attB+4G-tet 



amp 



attB-amp 



(Pos* Control) 



6, 10 



9,6 



824, 1064 
703 



IS, 22 



attB+4G-amp 1 3020, 3540 



320, 394 



Form of 

DNA 
Analyzed 

SC 



SC 



SC 
Aval+Bam 
SC 



SC 



Colonies 
Obtained of 
Predicted 
Size 
Oof 8 



lof8 



7 of 7 
7 of 7 
Oof 8 



SC 
PstI 



3 of 8 



8 of 8 
8 of 8 




0 20 40 60 80 100 120 140 160 
attB PGR concentration (fmole) 





Efficiency of clonirtg of 256 bp PGR 
Into a l>^^t^] vector 



1.40% 

1.20% 
1 .00% 

0.80% 4; 

0.60% -t 

0.40% 

0.20% 

0,00% 

0 




20 40 60 80 100 120 140 160 
attB PCR concentration (fmole) 



# 



Titration of 1 kb PGR Irv 3 ^ 
Reaction 




aire PCft concentration (frnote) 



Titration of 1 kb PCR In 
Reaction 




Efficiency of cloning of 1 kb PCR 
into a 0e$wbitfvi vector 



0.35% 
0.30% 
^ 0.25% 
£ 0.20% - - 
-S 0.15% 
V 0.10% 

0.05% 

0.00% 




HO 40 60 80 100 120 140 160 
attB PCR concentration (fmolo) 



Titration of 1.4 kb PCR in L ? 
Reaction 




attB PCR concentration (fmole) 



Titration of 1-4 kb PCR in £ * p 
Reaction 




attB PCR (ng) 



Eff iciency df cloning of 1-4kb PCR 
into a .>^s4+w(7^ Vector 



0.40% 
0,35% 

^ o.ao% - 

g 0.25% - 
| 0,20% 
£ 0.15% 
w 0.10% 
0.05% 
0.00% 



Z *V 



~'r »N»«>« *<■■» '— " *■ ' ' 




0 20 40 60 80 100 120 140 160 
attB PGR concentration (tmolo) 



Titration of 3.4 kb PCR in 
Reaction 

10000 T-™ = — r 




0 20 40 60 BO 100 120 140 160 
atlB PCR concentration (fmols) 



Titration of 3,4 kb PCR in B *> 
Reaction 




100 200 300 400 

flttB PCR (ng) 



c 
t> 

s 




Efficiency of cloning of 3.4 kb PCR 
into a ^stotf^' vector 




0 20 40 60 80 100 120 140 160 
attB PCR concentration (frnolo) 





Titration of 4,6 kb PGR in 13 * 
Reaction 



*g 10000 



TO 

E 



is 



o 
o 



1000 



100 - 



10 - 




20 40 60 80 100 120 140 160 

ottB PCR concentration (fmote) 



Titration of 4.6 kb PCR In h 
Reaction 




attB PCR (ng) 



Efficiency of cloning of 4.6 kb PCR 
into a <\pt^n^~ vector 



a. 35% 




o 



20 40 60 80 100 120 140 160 
attB PCR concentration (fmote) 




t 13 



%*4 



r**s 



Titration of 10.1 kb PCR in J** 
Reaction 



18 -i 
16 - 










14 ■ 










12 - 










10 - 


"V '.im 








8 - 










6 - 




t » • 






4 - 








1. "IZ-'i **. * 


Zi 




! * *•!• .*' *■ 






0 J 











0 200 400 600 

attB PCR (ng) 



800 1000 




?5 



10.1 kb PCR DNA Titration in Bx ^ Reaction 




1» «r to 



y y il li 



& r !** ^ ^ 



.sal 




Cloning of PCR Products of Different Sizes with the 
GATEWAY™ PCR Cloning System 



Size 


fmols PCR 


ng PCR 


Cols/ml 


i^orreci 




DNA 


DNA 


Transformation 


Clones/ 1 otal 








fr>TIC=l 0 8 CFU/ml> 


Examined** 


0.26 kb* 


15 


3 


1223 


10/10 (a) 




37.5 


7.5 


2815 




1.0 kb 


15 


10 


507 


49/50 (b) 




37.5 


25 


1 A A T 

1447 




1.4 kb 


15 


14 


27 1 


4o/D(J (C) 




37.5 


35 


683 




3.4 kb 


15 


34 


478 


9/10 (a) 




37.5 


85 


976 




4.6 kb 


15 


46 


190 


10/10 (a) 




37.5 


115 


195 




6.9 kb 


15 


69 


30 (235)** 


47/50 (b) 




37.5 


173 


54(463)** 





*The 0.26 kb PCR product was used unpurified; all the others were purified by precipitation with 
PEG/MgCl 2 as described in the text of Example 9, to remove primer dimers potentially present. 
Standard incubations were for 60 min. 



**overnight incubation 

(a) DNA minipreps 

(b) ampR/kanR 

(c) tetR/kanR 



Figure 77 





s. 



o 



Reading frame A: 



EcoRV 
1/2 site 



attR1 



attR2 



ATC 


ACA 


AGT 


Tlj3 


TAC 


AAA 


AAA 


TAG 


TGT 


TCA 


AAC 


ATG 


TTT 


| TTT 



- [Cnf -ccd* 



T T jC TTG TAG AAA GTG G 
A AAG AAC ATG | TTT CAC C 



EcoR V 
1/2 site 

AT 
TA 



Reading frame B: 
attR1 



attR2 



A TC AlAC A AGT TT k TAC AAA- AAA 
T AGXJLGT TCA AAC ATG TTT| TTT 



- [CnF 



_ ccd jT JTTjcjrTG_JTAC AAA GTG GTT GAT 
CC A AAG AAC ATG I TTT CAC CA ^. CTA 



Reading frame C: Cfcl^^^) 
attR1 



attR2 



AT CAA 



TA GTT 



ACA 


AGT 




TAC 


AAA 


AAA 


TGT 


TCA 


AAC 


ATG 


TTT 


TTT 



- [CnF -ccdJ 



T jc TTG TAC AAA GTG GXE_£GA T 
~AAG AAC ATG | TTT CAC C& ^QCT A 



U 5 

01 



Reading frame C: 

att Rl 



m K2 



AT C AA 
TA GTT 



ACA AGT TI p/WOT JU&/ 1 
TGT TCA AAC ATG TTT 



-CmR-ccdB] 



TTG TAC AAA GTG G' 
:</fl^/fy(yg\ TTT CAC CJ 



_TGA 1 
ACT I 



Fusion protein 

codon Reading frame A cassette 

i , 1 v 

/ s / \t<~ ,ra aat tta tac aaa aaa get 

.- nnn nnn ate aca agu 

- nnn nnn tag tgt tea aae atg tt t/ ttt ega 

\ — — r 

attR 1 

Reading frame B cassette 

' > 

r . ^ 0 _„ -> at tta tac aaa aaa get 

:: £E SS SS^SS SU «t t« =*. 



cannot be TG or TA 

Reading frame C cassette 



__- nnn nnn' nat eaa_aea agt ttg tac aaa aaa get 
__- nnn nnn nta gtt tgt tea aae atg ttt ttt ega 





Pvull 



.Notl 
■Xmalll 
l.Banl 



c 



Srfl Smal Alw21l Sdul 
Aval Mam' Alw44l 



Esp3l 
iPflMI 
ii .Dsal 

:| 'NCOl 

!! Styl 



attR1 



Kpn2l ii 

LApol ii 

sEcoRI ii 

j.BsrDI S 



Sspl 
.Hinfl 

rrfii 



Seal 
jMlul 

.Bst1107l AlwNJ. 
: Bbvl IBsmfl 



.Pstl 
iSfcl 
BstXI IBspMI 
.CfrlOiSall 
:Bsal iTaql 




attR2 



CmR 



(ccdA) 



ccdB 



ry 



1000 



1500 




rf A Cassette (1 711 bps) 



.Notl 

•Xmalll 

I.Banl 



Pvull 



Psp1406l 

Esp3l 



LPflMI 
Dsal 
Ncol 



.Kpnfel § 
lApol n 
{EcoRI s 
i.BsrDBsuI ~ 
II , II , J 



Styl 
Sspl 
; Hinfl 
Tfil 

Seal 
Bqlll 



TBssHII 
.Bst1107l .AlwNj 
: Bbvl -Bsmfl 
I.I ■ J .M 



attFM 



BstXI 
.Cfr10 
•Bsal 

II 



.PstI 
ISfcl 
BspMI 
tSall 
STaql 



T 



attR2 



CmR 



(ccdA) 



ccdB 



500 



1000 
J L 



1500 



rfB Cassette (1713 bps) 



pi CiU^ ii 





Dsal 
iNcol 
jStyl 
Sspl 
Hinfl 
rrfil 



.Notl 
iXmalll 
• .Hgal 

.Sau96l 
: .BsaAl 

UL1 r- 



Pvull 

.Kpn2l 
iApol 
5EC0RI 
i.BsrOI 

!• .Psp1406l 
\\ i.Gsul 



Seal 
.Xbal 
iMael 
iBamHI 
jBstYl 
1 .BssHll 

! • .Bst1107l AlwNI 
II I. I ■ I i 



BstXI 

ctnoi 

Pstl 
[Sfcl 
[BspMI 
Bsal LSall 
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rfC Cassette (issebps) 



1320 



C 



Srfl Smal BeaBI BslHKAl 
Aval Xmal Alw44l Bsp1286l 



Bpml,553 

BsmBI,670 
!Van91 1,678 
( Dsal,753 



Notl, 131 
;BstEI,132 
Eco52l,132 

Pvull r 35Q !! 
.Accl|l f 448;i 



i t Banl f 143 



iApo|452 
iEcoRI.454 
!.BsrDl f 464i ... 

!! .Pspl4i06i^dB 



|Ncol,753 
Styt,753 
Sspl ,762 
.1^11,802 

m. 1,802 . 

Scal,8$7 
,Xbal,8SB 
jBfal,$j0*> . . 
! .Bs3H31,946 [ 
1 * .Bistf 107^9*5 



1352 

Bbvl.1023 
!Tsel,1023 
" Smll,1066 

BbvCI,1176 
.AIwNl,1226 
>Bsmft,1228 

JBstXI,1433 



I I I I I I 



I Ifl Li 



I.Cfli 01,1 452 
Bsal.1455 

.Pstl.1574 
iSfcl f 1574 
[BspMI, 1576 
[Sall.1580 
lTaql t 1581 



attR1 



attR2 



CmR 



(ccdA) 



ccdB 



rfC cassette (1715 bps) 





prfC Parent III 4554 bp 

Location (Rase Nos . ) Gene Encoded 

410 . .286 attRl 

660 . .1319 CmR 

1439.. 1523 inactivated ccdA 

1661. .1966 ccdB 

2007 . .2131 attR2 

2753 . . 3613 a™P 

1 GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 
61 CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTAGCT 
121 CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT TGTGTGGAAT 
181 TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC CATGATTACG CCAAGCTTGC 
241 ATGCCTGCAG GTCGACTCTA GAGGATCCCC GGGTACCGAT ATCAAACAAG TTTGTACAAA 
3 01 AAAGCTGAAC GAGAAACGTA AAATGATATA AATATCAATA TATTAAATTA GATTTTGCAT 

3 61 AAAAAACAGA CTACATAATA CTGTAAAACA CAACATATCC AGTCACTATG GCGGCCGCTA 
421 AGTTGGCAGC ATCACCCGAC GCACTTTGCG CCGAATAAAT ACCTGTGACG GAAGATCACT 

4 81 TCGCAGAATA AATAAATCCT GGTGTCCCTG TTGATACCGG GAAGCCCTGG GCCAACTTTT 
541 GG CGAAAATG AGACGTTGAT CGGCACGTAA GAGGTTCCAA CTTTCACCAT AATGAAATAA 

6 01 GATCACTACC GGGCGTATTT TTTGAGTTAT CGAGATTTTC AGGAGCTAAG GAAGCTAAAA 
661 TGGAGAAAAA AATCACTGGA TATACCACCG TTGATATATC CCAATGGCAT CGTAAAGAAC 
721 ATTTTGAGGC ATTTCAGTCA GTTGCTCAAT GTACCTATAA CCAGACCGTT CAGCTGGATA 

7 81 TTACGGCCTT TTTAAAG AC C GTAAAGAAAA ATAAGCACAA GTTTTATCCG GCCTTTATTC 
841 ACATTCTTGC CCGCCTGATG AATGCTCATC CGGAATTCCG TATGGCAATG AAAGACGGTG 
901 AGCTGGTGAT ATGGGATAGT GTTCACCCTT GTTACACCGT TTTCCATGAG CAAACTGAAA 
961 CGTTTTCATC GCTCTGGAGT GAATACCACG ACGATTTCCG GCAGTTTCTA CACATATATT 

1021 CGCAAGATGT GGCGTGTTAC GGTGAAAACC TGGCCTATTT CCCTAAAGGG TTTATTGAGA 
10 81 ATATGTTTTT CGTCTCAGCC AATCCCTGGG TGAGTTTCAC CAGTTTTGAT TTAAACGTGG 
1141 CCAATATGGA CAACTTCTTC GCCCCCGTTT TCACCATGGG CAAATATTAT ACGCAAGGCG 
1201 ACAAGGTGCT GATGCCGCTG GCGATTCAGG TTCATCATGC CGTCTGTGAT GGCTTCCATG 

12 61 TCGGCAGAAT GCTTAATGAA TTACAACAGT ACTGCGATGA GTGGCAGGGC GGGGCGTAAT 
1321 CTAGAGGATC CGGCTTACTA AAAGCCAGAT AACAGTATGC GTATTTGCGC GCTGATTTTT 

13 81 GCGGTATAAG AAT AT AT AC T GATATGTATA CCCGAAGTAT GTCAAAAAGA GGTGTGCTAT 
1441 GAAGCAGCGT ATTACAGTGA CAGTTGACAG CGACAGCTAT CAGTTGCTCA AGGCATATAT 
1501 GATGTCAATA TCTCCGGTCT GGTAAGCACA ACCATGCAGA ATGAAGCCCG TCGTCTGCGT 
1561 GCCGAACGCT GGAAAGCGGA AAATCAGGAA GGGATGGCTG AGGTCGCCCG GTTTATTGAA 
16 21 ATGAACGGCT CTTTTGCTGA CGAGAACAGG GACTGGTGAA ATGCAGTTTA AGGTTTACAC 
16 81 C T ATAAAAG A GAGAGCCGTT ATCGTCTGTT TGTGGATGTA CAGAGTGATA TTATTGACAC 
1741 GCCCGGGCGA CGGATGGTGA TCCCCCTGGC CAGTGCACGT CTGCTGTCAG ATAAAGTCTC 
18 01 CCGTGAACTT TACCCGGTGG TGCATATCGG GGATGAAAGC TGGCGCATGA TGACCACCGA 
18 61 TATGGCCAGT GTGCCGGTCT CCGTTATCGG GGAAGAAGTG GCTGATCTCA GCCACCGCGA 
1921 AAATGACATC AAAAACGCCA TTAACCTGAT GTTCTGGGGA ATATAAATGT CAGGCTCCGT 
1981 TAT AC AC AG C CAGTCTGCAG GTCGACCATA GTGACTGGAT ATGTTGTGTT TTACAGTATT 
2 041 ATGTAGTCTG TTTTTTATGC AAAATCTAAT TTAATATATT GATATTTATA TCATTTTACG 

' 2101 TTTCTCGTTC AGCTTTCTTG TACAAAGTGG TTCGATATCG GTACCGAGCT CGAATTCACT 
2161 GGCCGTCGTT TTACAACGTC GTGACTGGGA AAACCCTGGC GTTACCCAAC TTAATCGCCT 
2 221 TGCAGCACAT CCCCCTTTCG CCAGCTGGCG TAATAGCGAA GAGGCCCGCA CCGATCGCCC 
2 2 81 TTCCCAACAG TTGCGCAGCC TGAATGGCGA ATGGCGCCTG ATGCGGTATT TTCTCCTTAC 
2 3 41 GCATCTGTGC GGTATTTCAC ACCGCATATG GTGCACTCTC AGTACAATCT GCTCTGATGC 
24 01 CGCATAGTTA AGCCAGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG 
2461 TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA 
2 521 GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGACGAAAG GGCCTCGTGA TACGCCTATT 
2 5 81 TTTATAGGTT AATGTCATGA TAATAATGGT TTCTTAGACG TCAGGTGGCA CTTTTCGGGG 
2 641 AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA TGTATCCGCT 
2 7 01 CATGAGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATGAGTAT 
2 761 TCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC 





2941 
3001 
3061 



™r*nnn** APrrTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG 

- ™ Ess ssss ss= ssss s=s 
iii sees = esse sees ssss 

3121 SEacc SSSESE acactgcggc caacttactt ctgacaacga tcggaggacc 
^ gaagSaScta accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg 
lllr ggISccggag" ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgtagc 

3301 AATGGCAACA ACGTTGCGCA AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA 
llll ACAATTAATA GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGOCCCT 
tccISctogc TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT CTCGCGGTAT 

Sttgcagca Sggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg 
gaScaggca actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat 
taagSttgg" t~aactgtcag accaagttta ctcatatata ctttagattg atttaaaact 

TCATTTTTAA TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT 

cccttaacS gXgttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc 
tcSSmat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct 
accagSSg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg 
StcagSga gcgcLatac caaatactgt ccttctagtg tagccgtagt taggccacca 

CTTCAAGAAC TCTGTAGCAC CGC CT AC AT A CCTCGCTCTG CTAATCCTGT TACCAGTGGC 
Zc^CMT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC T C AAGACG AT AGTTACCGGA 

4 0BX ESSE cggtcgggct gaacgggggg ttcgtgcaca c-cccjGCT togagc-aac 
Inll trcctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga 
4141 rrr cacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag 

ESSE ESSE cctggtatct ttatagtcct gtcgggtttc cccacctctg 
aStgaSS cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag 
All JaacSScc Stttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc 
SEES cSgItStg tggataaccg tattaccgcc tttgagtgag ctgataccgc 
ESSE cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aaga 
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i I Af 111,1 869 
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2543.AIWNI.. 
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CMV promoter 




fis^ pit 




pDEST28 7141 bp 

GCCCAGTACATGACCTTATGGGACTTTCCTACTao^ GCGTGGATAGCGG TTTGAC 

CCTATCRGTGATAGRGATCGTCGACGAGCTL^111«^ CGGRCT 
CGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGG 

ATGTACCTATAACCAGACCGTTCAGCTGG^TATTACGGC 

CCTGGCCTATTTCCCTAAAGGGTTTATTGAGAAT^ 
GGTGAGTTTCACCAGTTTTGATTTAAACGTGGCCAAT^ 

TTTCACCATGGGCAAATATTATACGCAAGGCGAC^^ 
GGTTCATCATGCCGTCTGTGATGGCTTCCATG^TCGGCAGAATGC 

GTACTGCGATGAGTGGCAGGGCGGGGCGTAAAGATCTGG^ 
ATAACAGTATGCGTATTTGCGCGC^ 

IgcSa?^^^^ 

GGGACTGGTGAAATGCAGTTTAAGGTTTACACC i A i GG ATGGTGATCCCCCTG 
TTTGTGGATGTACACSAGTGATATTATT^^CGCCCGGG^A^ 
GCCAGTGCACGTCTGCTGTCAGATAAAGTCTCCCGTG^ 
GGGGATGAAAGCTGGCGCATGATGACCACCGA^ 

CTGGGAAAACTGCTAGCTTGGGATCTTTGTGAAGG^ACCT 
ATTGGACAAACTACCTACAGAGATTTAAAGCTCTAAGGl^^^ 
ATAATGTGTTAAACTAGCTGCATATGCTTGCTGOT 
TTTATGAAAATATTATACACAGGAGCTAGTGAT^ 
CAGTCCCAAGGCTCATTTCAGGCCCCTCAGTCC^ 
CCATACCACATTTGTAGAGGTTTTACTTGCTTTAAAAAACCTC 

CCTGAAACATAAAATGAATGCAATTGTTGTTGTTAACTiG TTTTTCACTGC ATTC 

TT AC AAA'I'AAAGC AAT AG ^^^^^^^^^^p m CTTATCAT GT CT GG AT CG AT C CTG CAT T 
TAGTTGTGGTTTGTCCAAACTCATCA^ 
AATGAATCGGCCAACGCGCGGGGAGAGGC^ 
AGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGC 
CCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGC^ 



;AA^fccTTGATTAGGGTGATGGTTCACGTA^^G 



JGGCCATCGC 



TTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAATATTTAACGCGA 

cSgagSaI?^ 



jcctatttttataggttaatgtcatgata 
?tcggggaaatgtgcgcggaacccctatt 



agacaagctgtgaccgtctccgggagctgc^ 
t G t T t^ttttctaaa^^ 



aaacgcgcgagacgaaagggcctcgtgatacgcctatttttataggttaa^ 
ataatggtttcttagacgtcaggtggcacttt^ 



atgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgccctt 

GCCTTCCTi 
TGGGTGCA 
TTCGCCCC— 

Kaaottctgctatgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggt 



-" TrrrTTTTTTGCGGCATTTTGC CTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAA 



CGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGA^AGCAT 

"\TTA 

jATC ~ 

CACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCC 



ATACCAAACGACGAGCGTGACACCACGA' 



TGCCTGTAGCAATGGCAACAACGTTGCGCAAA 



=iiiF=^S 



S?S?^Ir?TArrAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCA 
^SS?Scc?CGCTCTGCTAATCCTGTT^^ 

5Sc5tIJcSSgttggac^ 

I??cggSS??cgtgca^^ 

??ScagcStca§cattgag^ 

rCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGG 

5Sg?™???2?aS?cSgtcgggtttcgc^ 
gI?aa^a?^^ 



CGCAGCGAGTCAGTGftGcjl^GC^ 

^A^GGTTCCGCGC^^ 



JCCTCTCCCC 




f1 intergenic region SV40 promoter 





pDEST29 7156 bp 

GGCTTTACACTTTATGCTTCCGGCTCGTATAATGTGTGGA 



taaagaccgtaaagaaaaataagcacaagttttatccggccttt^ 

GCCTGATGAA 



CGTGTTACGGTGAAAACCTGGCCTATTTCCCTAAAGGGTTTATTGAGAATATGTTTTTCG 

GGGC 

tgccgctggcgattcaggttcatcatgccgVotgtgatggcttccatgtcggcagaatgc 



ttaatgaattacaacagtactgcgatgagtggcagggcggggcgtaaacgcgtggatccg 
tacagtgacagttgacagcgacagctatcagtYgctcaaggca^^ 



'TCTGCTGACGAGAACAGGGACTGGTGAAATGCAGTTTAAGGTTT 

Satcgtgatccccctggccagtgcacgtctgctgtcagataaagtctcccgtgaacttta 



GAGCCGTTATCGTCTGTTTGTGGATGT 



rrrcpTGGTGCATATCGGGGATGAAAGCTGGCGCATGATGACCACCGATATGGCCAGTGT 
AAACGCCATTAACCTGATGTTCTGGGGAATATAAATGTCAGGCTCCGTTATACACA 



GCCGGTCTCCGTTATCG 




ca?Satca?aatcagccataccacatttgtaga^ 



ArArCTCCCCCTGAACCTGAAACATAAAATGAATGCAATTGTTGTTGTTAACTTGTTTAT 
TPrarrTTATA^TGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGC 





GGCCTGAAATAACCTCTGA^GAGG^CTTGGTTAGGIA CCAGCAGGCAGAA 

AGCTGTGGAATGTGTGTCAGTTAGGGTGTC^^ 

GTATGCAAAGCATGCATCTC^TTAGTCAGC^ 

CAGCAGGCAGAAGTATGCAAAGCATGCATCTCAAllA^i TC CGCCCCATGGCT 

TAACTCCGCCCATCCCGCCCCTAACT^^ 

GACTAATTTTTTTTATTTATGCAGAGGCCGA^ 

AGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTT^ 

CAACAGTCTCGAACTTAAGACCATGGCCAAGCCTTTC 

TGAAAGAGCAACGGCTACAATCAACAGCATCC^ 

CGCAGCTCTCTCTAGCGACGGCCGCATOT^ 

GGGACCTTGTGCAGAACTCGTGGTGCTGGGCACTGCl^ C CCTGCGGACGGTG 

GACTTGTATCGTCGCGATCGG^TGAG^ 

CCGACAGGTGCTTCTCGATCTGCATCCTGGGAT 

ACAGCCGACGGCAGTTGGGATTCGTGAATTGCTGCC^ 

AGCACTTCGTGGCCGAGTTCGAAATGACCGACC^ 

GGCCGCAATAAAATATCTTTATTTTCATTACATCTG^ TC TGATGCCGCATAG 
ATAGCGATAAGGATCCGCGTATGGTGCACTCT^ 
TTAAGCCAGCCCCGACACCCGCCAACACCCG^ 
CCGGCATCCGCTTACAGACAAGCTGTGACCG^ 

TCACCGTCATCACCGAAACGCGCGAGACGAAAGGGCU CTTTTC GGGGAAATGTG 
GTTAATGTCATGATAATAATGGT^ 

CGCGGAACCCCTATTTGTTTATTmCTA^TACAT^C GTATG AGTATTCAACAT 
CAATAACCCTGATAAATGCTTCAATAATATTGAAAAA 
TTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCA^ 
GAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCA^^ 

GAACTGGATCTCAACAGCGGTAAGATCOT^ 

ATGATGAGCACTTTTAAAGTTCTGCTATGTGG 

CAAGAGCAACTCGGTCGCCGCATACACTATTCTCA^ 

GTCACAGAAAAGCATCTTACGGATGGCATGA^ 

ACCATGAGTGATAACACTGCGGCCAACTTACTTCT^ 

CTAACCGCTTTTTTGCACAACAT^ 




ACAACGTTCCULiWi^ x « ± - - - - - - ACCAC TTCTGCGCTCGGCCCT T«-^^ * 

cgtgagttttcgttccactgagcgtcaga^ 
gatcctttttttctgc^ 

gtggtttgtttgccggatcaagagctaccaactc^ TAGTTAGGCC accacttcaag 
agagCgcagataccaaatactgtccttctagtgtagc^ TACC agtggctgctgcc 



AGTGGCGATAAGTCGTGTCTTACCGGGTlG^^i GCTT GGAGCGAACGACCTAC 
CAGCGGTCGGGCTGAACGGGGGGTTCGTGCACA^ 

^CGAACTGAG^^ 




CGTCGATT^T^^ 





AGCCGAACGACCGAGCGCAgTaGTCA^^^ 

AAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAftl QRTRCRTRTT TGAA 
TTTCAATATTATTGAAGCATTTATCAGGGTTATTGTC GAAAAGTGCCACCT 



GA 1 

CCCTTTCACTCATTAG 



O 
Q 



CMV promoter 




f1 intergenic region 




pDEST3 0 7 544 bp 

ATGCATGTCGTTACATfACTTACGGTAAATGGCCCGCCTGGCTGACCGCCC 

GGCGAGATTTTCAGGAGCTAAGG^GCT^TGGAGAAAAA^lCA TTGCTCA 

CCTGGCCTATTTCCCTAAAGGGTTTATTGAGJ^ 
GGTGAGTTTCACCAGTTTTGATTTAAAC^GTGGCCAATATGGACAA 

GCTTCCATGTCGGCAGAATGCTTAATGAATTACAACA 
.GTGGCAGGGCGGGGCGTAAAGATCTGGATCCGGCTTACTAAAAGCCAG 



^AAACGTGGCCAATATGGACAACTTCTTCGCCCCCGT 
TTTCACCATGGGCAAATATTAT^ 

:gtgccgaacgctggaaagcggaaaatcagg 



TACCCGAAGTATGTCAAAAAGAGGTGTG 
AGCGACAGCTATCAGTTGCTCAAGGCATATAT^^ 
CAACCATGCAGAATGAAGCCCGTCGTCTGCGTCCCG^^ 



AAGGGATGGCTGAGGTCGCCCGGTTTATTGAAATUA 
GGGACTGGTGAAATGCAGTTTAAGGTTTACACCTATAAAA^ 
TTTGTGGATGTACAGAGTGATATTATTGACACGCCCGGGCGAC^t^Ai^u 

GCCAGTGCACGTCTGCTGTCAGATAAAGTCTCC 
GGGGATGAAAGCTGGCGCATGATGACCACCGATA^GGCCAGTG^ 

CTTATACACAGCCAGTCTGCAGGTCGACCA 
.TTATGTAGTCTGTTTTTTATGCAAAATCTA 



GGGGAAGAAGTGGCT^^ 

ATGTTCTGGGGAATATAAATGTCAGGCTCCCTTATACACAGC 
ATAATGTGTTAAACTAGCTGCATATGCTTGCTGCTT^ 



TTTATGAAAATATTATACACAGGAGCTAG^TGATTCTAA 



TAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTGGATC 
CCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGT^ 




ATCTCAATTAGTCAGCAACCAGGTGTGG^GTCCCCAGGCT 

rprprp t\ Tnps & ^^^^^^^^^^^^^^^^^^qq^^q^^^CTTCTGAC A^AAC AGT CT CGAACT 

A^CAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTG 



?aILctagagccaccatgattgaacaagatg^^^ 



OGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGG™^ 



GGACCX 



CGAAGTGCCGGGGCAGGATCTCCTGTCATCT^ 
CATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATC^CG 

CTTCTATCGCCTTCTTGACGAGTTCTTCTGAGCGG^A^i^ TC ATTACA 
CTCCGGGAGCTGCATGTGTC^GAGGTTTO^CCGT^l^^ 

GG gcctcgtgatacgcctat™taggtt^ 

jAaa c ^ CCCTGATAAATGC ttc aataatattg 

a^ggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcggc 

attttgccttcctgtttttgctcaccc ' 
tcagttgggtgcacgagtgggttaca': 

GAGTTTTCGCCCCGAAGAACGTTTT^^ 



gtcaggtggcacttttcggggaaatgigc^^^^ TGATAAAT gcttcaataatattg 
acattcaaatatgtatccgctcatgagaca™ 
aaaaaggaagagtatgagtatti 
attttgccttcctgtttttgcti 

TCAGTTGGGTGCACGAGTGGGT' 
GAGTTTTCGCCCCGAAGAACGT' 

'IgcaItcgcaacaacgttgcgcaaactattaactggcgaact 
5caaca1ttaatagactggatggaggcggataaagttgcagg 
^cttccggctggctggtttattgctgataaatctggagccgg 

"'TATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTAT 

-??5gagtcaggcaactatggatgaacgaaatagacagatcgc 
^gS??a1Sca??ggtaactgtcagaccaagtttactcatatat 
Sot?catttttaatttaaaaggatctaggtgaagatcctttt 
Sa?ccc??aacgtgagttttcgttccactgagcgtcagaccc 
gatcttcttgagatcctttttttctgcgcgtaatctgctgctt; 
cgctaccagcggtggtttgtttgccggatcaagagctaccaac 



tgtaactcgccttgatcgttgggw^^------- cg taactggcgaact 

gcactggggccagatggtaagccctcccgtat 
gcaactatggatgaacgaaatagacagatcgc 

'TGGTAACTGTCAGACCAAGTTTACTCATATAT 
'TAATTTAAAAGGATCTAGGTGAAGATCCTTTT 
lCGTGAGTTTTCGTTCCACTGAGCGTCAGACCC 



[TCAGGCAACTATGGATGAACGAAATAGACAGATCGC 
lGCATTGGTAACTGTCAGACCAAGTTTACTCATATAT 





TTTTGCTCACATGTTCTTTCCTGCGilAi^^ CGACCG AGCGCAGCGAGTCAGTGAG 
CTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCG^ 



/ . 



• 



CMV promoter 




f1 intergenic region 





pDEST31 7 559 bp 

ATGCATGTCGTTACATAACTTACGGTAAATGGCCCGCCTGG 

CGCCCATTGACGTCAATAATGACGTATGTTCCCA 

TGACGTCAATGGGTGGAGTATTT^ 




AATCAACGGGACTTTCCAAAATGTCGTAACAAC^TCCGCCC 

CG^CATCCACGCTGTTTTGACCTC^^ 
ATGGCGTACTACCATCACCATCACCATCA^ 
TTGTACAAAAAAGCTGAACGAGAAACGTAAAAT^3^ 
ATTTTGCATAAAAAACAGACTACATAATACTGT 

C GG C C GC AT T AGGC AC C C ^ ^^^^^^^S^S q^qq^ AAGGAAGCT AAAATGG AG AAAAAAA 
TTTTGAGTTAGGATCCGGCGAGATTTTCAGGAG 

TCACTGGATATACCACCGTTGATATATCCCAATGGC^^ 



ACTTCTTCGCCCCCGTTTTCACCAl^^^^ GGCTTC CATGTCGGCAGAATGC 
TGCCGCTGGCGATTCAGGTTCATCAT^GCCGTCTGTG 

3ATAACAGTATGCGTATTTGC 
^TACCCGAAGTATGTCAAAAA 
-AGCGACAGCTATCAGTTGCT 
TCCGGTCTGGTAAGCACAACCATGCAGAATGAAGCC^ 
AAAGCGGAAAATCAGGAAGGGATGGCT 



TTAATGAATTACAACAGTACTCC^Ai^ 
GCTTACTAAAAGCCAGATAACAGTA^ 

TATATACTGATATGTATACCCGAAGTATGTCAAAAA^ GGCATATA TGATGTCAATATC 
TACAGTGACAGTTGACAGCGACAGCTA^ 



TTTGCTGACGAGA^CAGGGACTGGTGAAATG 

gagccgttatcgtctgtttgtggat^ 

GATGGTGATCCCCCTGGCCAGTGCACGTCTGCT^^ 

CCCGGTGGTGCATATCGGGGATGAAAGC^ 

GCCGGTCTCCGTTATCGGGGAAGAAGTGGCTC 

AAACGCCATTAACCTGATGTTCTGGGGAATAT^ 

GTCTGCAGGTCGACCATAGTGACTGGATA^ 

ttttatgcaaaatctaatttaatatattgatat™ 

CTTTCTTGTACAAAGTGGTGATGGGCGGCCG^ 
GCGACGTCATAGCTCTCTCCCTATAGT^GA 




a^ttt™^^^ 

G CT TACT G AGT ATG ATT ^^^^^^^^^^^ G A ^^^ C AGGC C C CT C AGT C C T C AC AGT CT GT T 





c.?££?g^a??^^^^ 

PC AGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGG 

TGCCTGCTTGC^G^ATATCATGGTGG^AAATG^CCGCTTTTCT 

CCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGA 



CCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTG 

TTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGGGG 



AGAGCTTGGCGGCGAATGGGCTGA 



T-rrrAAATGACCGACCAAGCGACGCCCAACCTGCCATCACGATGGCCGCAATAAAATATC 
PPTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCC 



a r A AP PTrTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAA 
ACAAGCTGTGACCGTCTCC^A^ atacgcctatttttataggtt ^ tgtcatgataat 

"aa-?ggtttcttagacgtcaggtggcacttttcggggaaatgtgcgcggaacccctatttg 



acgcgcgagacgaaagggcct 



■T^TATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAAT 

TCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGA 

r^T^AAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTT 
Sr?^GC?A?OTGGCGCGOT^^ 

?ScgSItgSStSca^ 

a ppa^ A CP Arc AGCGTGACACCAC 

S?5a^tcgcga1c?actta^ 

^I^A^OT^SctGGACCACTTCTGCGCTCGGCCCTTCCGGCTGG 
TAAATC^GGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCA 

IaISag^ca^^ 
a^tactc^t^ 

^^^3|™TTTTGATAATCTCATGACCAAAATCCCTT^ 

^^^^^^P^P^q^qqq^q^^q^^^AGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCG 
TrAA^AGCTACCAACTCTTTTTCC 

?Sctg?cc?tc?a^tgtagccgtagttaggccaccacttc^ 
?aS?IcScgctc?g?taa^ 

tcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctga 




GGGGGGTTCGTGCACACAg|TcA^ 
ACAGCGTGAGCATTGAGAAAGCGCCACCOT^ 

CTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAA 
GGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGT 

TAACCGTATTACCGCCTTTGAGTGAGCTGATACCG 
CAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGC^CCCAATACG 

GCGTTGGCCGATTCATTAATGCAGAGCTTGCAATTC^^ 




pDEST32 12288 bp 

MTTTTTCTTTAATTTCTTTTTTTRCTTTCTATTTT^Tl GGCAC ttttcgg 
ATTTAAATTATAATTATTTTTATAGCACGTGATGAAAAGOAC ^^^^^^^^^^^^^ 

ACGGCCGAGGTCTTCCGATCTCCTGAAGCCAGGGCAGATC^ CCTTGC GCTCGT 

gcagcaacgatgttacgcagcagcaacga^ 

AAGTTAGGTGGCTCAAGTATGGGCATCATTCGCA 

aaatccatgcgggctgctcttgatcttttcgg^ 

TCCCAACATCAGCCGGACTCCGATTACC^TCGGGAACTTG 

GCGCTTGCTGCCTTCGACCAAGAAGCGGTTGTTGG^i ccggcgagcac 

AGGTTTGAGCAGCCGCGTAGTGAGATCTATATCTATGATCT^^ 

CGGAGGCAGGGCATTGCCACCGCGCTCATCAATCTCCTCAAGC^ 

GGTGCTTATGTGATCTACGTGCAAGCAGATTACGGT^^ 
ACAAAGTTGGGCATACGGGAAGAAGTGATGCACTTTGA^ 

TGTAACACTGGCAGAGCATTACGCTGACTTGA 

AGAACTCTGTAGCACCGCCTACATACCTCGCTCT^ 
CCAGTGGCGATAAGTCGTGTCTTACCGGGT^ 

CGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACAC^ TC CCGAAGGGA 
AGAGCGAACTGAGATACCTACAGCGTGAGCATT^ 

TTCCAGGGGGGAACGCCTGGTATCTTTATAGTC^ 
AGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCCGAGCCTATG 

CGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTG 

TATCCCCTGATTCTGTGGATAACCGTATTACCGL.L.11 CGGAA GAGCGCCCAATAC 
GCAGCCGAACGACCGAGCGCAGCGAGTCAGTG^ 





ACTAAAGGGAACAAAAGCTGGTACCGATCCCGAGCTTTGCAAATTA 
CCCAAAACCTTCTCAAGCAAGGTTTTCAGTATAATGTTACATGCGT 

TCCTTCTTCAACCCACCAAAGGCCATOTGG^ 

•TTGTGGAAGTATCTCATACCAACCTTA 
JVTTCTGTGGTGATGTTGACCACCGGCC 
.CCGATACGACCTTTACCGGCTGAGACG 
5GAAGGCATTCTTGATTAGTTGGATGAT 
^AGGAAAAT C AACGGAGAAAGCAAACGC 
[■TTGAACCTATCTGGAAAATAGCATTAf 
3TCTCTGCGGGCTATTCACGCGCCAGAC 
\TAATTTTGATTTTGGTAATGTGTGGGT 
"TCTTGTTTTTGCTGTGTTTTTCGATGi 
rCACCGATGCTAAGTTATCTCTATGTAi 
AAGAGATACAGGATTGGCAACTGCAAA r 
TCGAAGAAATGATGGTAAATGAAATAG< 

zatacaacactggaaatggttgtc" 
rttattatgttacaatatggaagg 

\AGTCCAATGCTAGTAGAGAAGGG 
CTAAACCGTGGAATATTTCGGAT2 
CTCTTTTCTGGCAACCAAACCCA1 
TAACATGTAGGTGGCGGAGGGGAG 

ggctaaacaagactacaccaatt; 

ACTGTAGCCCTAGACTTGATAGCC 



TGGGAT C AATGT C CACaai , -™^cq^ 

:a 

ST 
CT 

:c 
rc 

AC 
Tl 

TCG^GAAATGA^ 



CTTGTGGAAGTATCTCATACCAACCTTAC^ 

'GA 
'GT 

:at 
;a? 

VTC 

AAG AG AT AC AGGATT GG ^ AAC TG C AAAT AGi-* q^^qq a qq^tGAAGG CAAAAG AC AAAT A 



CCATCTTAAATATACGGGATACAGATGAAAGGG 
AACAAGGGAAAAACTGCGAGGAAAATTGTTTGC 
.GGAAAATAGGAAAAATAACAGGGCATTAGAAAA 
^JTCCTGGTGTACAGATGTTACATTGGTTACAGTA 

CTCTTGTTTTTGCTGTGTTTTTCGATGAATCTCCAAAATG^TT 



TTTATTATGTTACAATATGGAAGGGAAC^TTACACTT 



AAGCATGCGATATTTGCCGACTTA^AAAGCTCAAGTGCTCCAAA 



.TGATTTTGAAAATGGATTCTTTACAGGATA 





ATATCCAGTCACTATGGCGGCCGCT^GTTOGCAGCATCACCC^ TGTCCCTGTTGA 

TAT AT C C ^^^^^^^^^^^^^^^p^^? m^^QQQC CTTTTTAAAGAC CGT AAAG AAAAATAA 
CTATAACCAGACCGTTCAGCTGGATATTACGGCCllli^ 

GCACAAGTTTTATCCGGCCTTTATTCACATTCTTGC^^ 




CATGGGC^TATTATACGCAAGGCGACAAGGTOT 

GATGTACAGAGTGATATTATTGACACGC^ 

GCACGTCTGCTGTCAGATAAAGTCTCCCGI^AA^iiJ-^ rGGTCTC CGTTATCGGGGAA 



TGGGGAATATAAATGTCAGGCTCCCTTATAUA^A^^ 
CTGGATATGTTGTGTTTTACAGTATTAT^ 

T^attgat^^ 



ATOGCCGCTAAGTAAGTAAGACGTCGAGCT™^ 
AGCTTTGGACTTCTTCGCCAGAGGTTTGGTCAAGTCTC^ 
TACCTTGCCAGAAATTTACGAAAAGATGGAAAAGGGT^ 
TGACACTTCTAAATAAGCGAATTTCTJTATGATT 

AAAAAAAAT AAGT GT AT AC AAATT T ^^^^^^^^S p^!^!^ q^qq>p AT AG CAT GAGGT CG C 
GTTCTTGAGTAACTCTTTCCTGTACMT^GGTTGCTTTCIOA^^^ 

TCTTATTGACCACACCTCTACCGGCATGCCGAGCAAATGCC 
CACCCAATTGTAGATATGCTAACTCCAGCAATGA^ 

TGTC CT C AGAGG AC AAT AC ^^^^^^^ _^pS, pq^^^^^q^^^qt CGTGACTGGGAAAAC C C 
TAGTGAGTCGTATTACAATTCACTGGCCGTCGTTTTAC^AAC 

TGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTT 

CGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTT^GCG 

GCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGT 

ACACTTGCCAGCGCCCTAGCGCCCGCTCCTTT^ 

GGGATTTTGCCGATTTCGGCCTATTGGTTAAAA^ 

tataggataattatactctatttctcaacaagtaattgg™ 




r AATCGTCTTACTTTCTAACTTTTCTTACCTTTTACATTTCAGCAATATATATATATATT 
rrCAGGTGACCACGTTGGTCAAGAAATCACAGCCGAAGCCATTAAG 

??SgS?ct5cS5tccaatgtc^ 

TATCGATGCTACAGGTGTCCCACTTCCAGATGAGGCGCTGGAAGCCTCCAAGAAGGTTGA 

?SccS???S™ggtgctgtg 

^gcSSgIS^ttttagacttatctccw 
^T^^5^^otcagagaattagtgggaggtatttactttggtaag 

caSg^aI^tacattgaaggttcaacat^ 
cttggcctctttgccagacaagaacaccgcatttggtttgtacgaaccatgccacgg 

^^^^^^CA^GAtTAAGGTTGACCCTATCGCCACTATCTTGTCTGCTGCAAT 
r ATCTTGA^ATTGTCATTGAACTTGCCTGAAGAAGGTAAGGCCATTGAAGATGCAGTTAA 

a1£cS?gI?g^ 
???S?ga?a??5otacataaac 

tca^c^tgata^gg^aaaagaattgcactttaacattaatattgacaaggaggagggcac 
ca^caa^ot?a^^ 

^i?GGjOTTTAAGTCAATA^CTTGAA CCATTTCCCATAATGGTGAAAGTTCCCTC 

aXga^ttttactc^gtcagaaacggccttacgacgtagtcgatatggtgcactctcagta 
^^J™^Xtgccgcatagttaagccagccccgacacccgccaacacccgctgacg 

ggagctgcatgtgtcagaggttttcaccgtcatcaccgaaacgcgcga 



F1 intergenic region 






pDEST33 8815 bp 



AGTCTTTTACACCATTTGTCTCCACACCTCCGCTTACATCAACACCAATAACGCCATTTA 
ATCTAAGCGCATCACCAACATTTTCTGGCGTCAGTCCACCAGCTAACATAAAATGTA^ 
TTTCG^GGCTCTCTTGCCTTCCAACCCAGTCAGAAATCGAGTTCCAATCCAAAAGTTCAC 
CTGTCCCACCTGCTTCTGAATCAAACAAGGGAATAAACGAATGAGGTTTCTGTGAAGC 

SctcaStaotatgttgcagtcttttgg^ 

GGAACTCTTGGTATTCTTGCCACGACTCATCTCCATGCAGTTGGACGATATCAATGCCGT 

a2^a?tcacc^ 
a^tt^a^t^ 

cg^StctIg^^ 

gaccagaactacctgtgaaattaataacagacatactccaagctgcctttgtgtgctta^ 

?cacS?a^actcacgtgctcaatagtcaccaatgccctccctc 

t?ttttcg^ccg^ 

acgta^ggtgacaagctatttttcaataaagaatatcttccactactgccatc 
a5^gca^g^cacatatattacgatgctgtctattaaatgct 

t^cta^tct^^^ 

gccagcc^cgacacccgccaacacccgctgacgcgccctgacgggcttgtctgctcccgg 
ca?c?gcSacSgacaagctgtg 

cot?Sc™ga^cgcgcgagacgaaagggcctcgtgata^ 
a?ctcItca?aa^ 

III?aIacaaaggtttaaaaaatttcaacaaaaagcgtactttacatatatatttat^ 

A^AAGAAAAGC AG AT T AAAT AG AT AT AC AT T CG ATT AACG AT AAGT AAAATGT AAAATC A 
CAGGATTTTCGTGTGTGGTCTTCTACACAGACAAGATGAAACAATTCGGCATTAATACCT 

Sagagcaggaagagcaagataaaaggtagtatttgttggcgatccccctag 

?a??tt5ggaaaacaaaaIcta^ttttctttaatttctttttttactt 

tttatatatt^ata^taa^a^atttaaattataattatttttatagc 

gSJccaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttatttt 

a^StcaaStatgtatccgctcatgagacaataaccctgataaatgcttca^^ 

tgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcg 

^J^gccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgctg 

gatcagttgggtgcacgagtgggttac^ 

pap acttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgt 
tc?cSa^tcIc?tggttgagtactca 

IcIgtaagagaattatgcagtgctgccataaccatgagtgataacactgc 
cttctg^caacgatcggaggaccgaaggagctaaccgctttttttcacaacatgggggat 

catgta^ct^gccttgatcgttgggaaccggagctgaatga 
cotgIcIccacgatgcctgtagcaatggcaacaacgttgcgca^^ 

c?I2t?actctagcttc^ 
gSccScttS^^ 

ggtgagcgtgggtctcgcggtatcattgcagcactggggccagatggtaagccctcccgt 
atcIt^g??^^ 

SctSSataggtgcctcactgattaagcattg 
a?IctttISattgatttaaaacttcattt 

tttgataKtctcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagac 

cccgtagAaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgc 

ttgcaaacaa^aaaaccaccgctaccagcggtggtttgtttgccggatcaagagctacca 

actctttttccgaaggtaactggcttcagcagagcgcagataccaaatactgtccttc 

gtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgct 

ctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttg 

gIctcaagacgatagttaccggataaggcgcagcggtcgggctgaacggggggttcgtgc 

acScSg^ccagcttggagcgaacgacctacaccga^ 





TGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGG 

CCTGTCGGGTTTCGCCACCTCTGA^ 
CCGAGCCTATGGAAAAACGCCAGCAACGCGGCCm^ 

^^^^^^^^^^^^^ 

agSaagaatgccggttggg^ 



CAAGACATAATGGGCTAAACAAGACTACT 
GTACATAACGAACTAATACTGTAGCCCTAG^CT^^^^--^^^^^^^^^^^^^^^^^,^,^^, 



a?gI?g?aagacactaaaggaaaaaa™^ 

TTCCAGAGCTGATGAGGGGTATCTTCGAACACACGAAACTTTTTCCTTCCTT 
CACACTACTCTCTAATGAGCAACGGTATACGGCCTTCCTTCCAG^TAC^^^ ^ 
TAAAAAAAGTTTGCCGCTTTGCTATCAAGT^TAAATAGACCTGCAATTA 



TTTC^CTCGTCATTGTTCTCGTTCCCTTTCTTCCTTGTTTC 

. . ^^TifTir-^a^Tr^f^ziTGATGAAGATACCCCACCAAACCCAAAAAAA^^^^^^^^^^^^^^ 



ttaaattagattttgcataaaaaa 
c^gSc^^ 

GAGCTAAGGAAGCTAAAATGGAGAAAAAAATCACTG^ 

CTAAAGGGTTTATTGAGAATATGTTTTTCOl^i^ TC ACCATGGGCA 




GGCAGGGCGGGGCGTAATCTAGAGGATCCGGOTACTf^GCCA^ GTRTGT 

S€=i==iii 

GCTGTCAGATAAAGTCTCCCGTG^CTTTACCC^l^ GGGGAAGAAGTGGC 
GCGCATGATGACCACCGATATGGCCAGTGTGCCGGTClC^i^i 

mmwtmmm 



CTCTTATTGACCACACCTCTACCGGCATGCCGAGCAAATGCCTGCAAA 

TAACTCCAGCA 

KaSg^aaacgttaatatttS^ 



TT TT AAC C AAT AGG C C G AAAT CGG C AAAAT C C C TT AT AAAT C AAAAG AAT AG AC ^^^^^^ 

3Gi 
AT< 

GC< 

GAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTG?TAGCGGTCACGCTGCG 



AGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAA. 

ccga^ttagagcttgacggggaaagccgg"o3aacgtggcgagaa 



CGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCTA 
ATCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGC 



JCACTAAATCGGAACCCTAAAGGGAGCCC 
iACGTGGCGAGAAAGGAAGGGAAG; 
iTAGCGGTCACGCTGCGCGTAACC; 
ACCCGCCGCGCTTAATGCGCCGCTACAGGGCGCGTCCCATTCGCCATTCACTGCA 




pDEST34 



7114 bp 




Location (ttaflP. Nos.) 



Gene 



195 . . 

304 . 

1305 

1651 

1780 

2675 

3334 

4343 



71 

963 

. 1610 
. . 1775 
. . 2472 
. . 2720 
. .4194 
. .4982 



Encoded 

attRl 

CmR 

ccdB 

attR2 

GST 

T7stop 

ampR 

ori 



C CT CT AG AT C AC AAGT TT GT ACAAAAAAGCTG AACG AGAAAC^ 
CAATATATTAAATTAGATTTTGCATAAAAAAC^ 

GGTGAGCTGGTGATATGGGATAGTGTTCACCOT^ 

GGCGACAAGGTGCTGATGCCGCTGGCGATTCAGGT^TCA 

CATGTCGGCAGAATGCTTAATGAATTACA^ 

TAAACGCGTGGATCCGGCTTACTAA^GCCA^ 

TTTTGCGGTATAAGAATATATACTGATATGTAIAL.^^ 

CTATGAAGCAGCGTATTACAGTGACAGTTGACA^ 

ATATGATGTCAATATCTCCGGTCTGGTAAGCA^ 

GCGTGCCGAACGCTGGAAAGCGGAAAATC^ 

TGAAATGAACGGCTCTTTTGCTGA^ 

ACACCTATAAAAGAGAGAGCCGTTATCGTClUlli TCTGCTGTCAGATA AAG 
ACACGCCCGGGCGACGGATGGTGATCCCCCTGGCCACT^ 

T CT C C CGTGAACTTT AC CCGGTGGTGC ATATCGGGGA^GAA CAGCCAC C 

ccgatatggccagtgtgccggtctccgt™ 

GCGAAAATGACATC^AAAACGCCATTAACO^ 

cccttatacacagccagtctgcaggtcgaccat^^ 

tattatgtagtctgttttttatg^ 

tacgtttctcgttcagctttcttgtac^ 

tggaaaattaagggccttgtgcaacccactcgact^tc^ 

tatgaagagcatttgtatgagcgcgatg^ 
ttgggtttggagtttcccaatcttccttatta^^^ 

tctatggccatcatacgttatatagc^ 
gagcgtgcagagatttcaatgcttgaaggagcgg^™^ 

agaattgcatatagtaaagac™ 
cccgataagggagcaggccagtaaa^gcattacccgtggt^ 





ACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGGAACTATATCCGGATATCCACAGGACGG 
GTGTGGTCGCCATGATCGCGTAGTCGATAGTGGCTCCAAGTAGCGAAGCGAGCAGGACTG 
GGCGGCGGCCAAAGCGGTCGGACAGTGCTCCGAGAACGGGTGCGCATAGAAATTGCATCA 

GAGA^TTCTTGAAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATG 

Sa^aItcg?^ttagacgtcaggtggcacttttcggggaaatgtgcg 

ATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGA 

^™^™AATAATA^GAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGC 

CTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACC 

AACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACT 
TTTA^AGTTCTGCTATGTGGCGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTC 

Sotc^cgcatacactattctcagaatgacttggttgagtactcaccagtcacagaaaag 

CATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTG 

a^?ac?gcggccaacttacttctgacaacgatcggaggaccgaaggagc 

^gcSaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctg^ 

S?ca?accaaacgacgagcgtgacaccacgatgcctgcagcaatggcaacaacgttgcgc 

a^c?a?5aactggcgaactacttactctagcttcccggca^^ 
^gcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtttatt 

gc?ga?SI^gSgccgg^ 

gatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactatggat 
-•ga^cgaaatagacagatcgctgagataggtgcctcactgattaagcattggtaactgtca 
ga^cSg^^tactcatatatactttagattgatttaaaact 

atctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcg 

SgcSc5?a15??gc?gcttgc^ 
c?gg2?2a^agctaccaactctttttccgaagg^^^ 
cca^?actgt?cttctagtgtagccgtagttaggccaccacttcaag^ 
ccgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcgataag 

tcgtgtctta^cgggttggactcaagacgatagttaccggata 
?gS?Sgggggttcgtgcacacagcccagcttggagcgaacgacctacaccg^ 
tacctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaaggcggacagg 
?atccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaac 

^^GOTATCTTTA^GTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTG 

?gSgc?cg^ 
??c??ggottttgctggcct 

gtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgacc 
cIgcgcagcgagtcagtgagcgaggaagcggaagagcgcctgatgcggtattttctcctt 

Sgca5c?g?SSgta?ttcacaccg 

5?gccgca?agttaagccagtatacactccgctatcgctacgtgactgggtcatggctgc 
Sc?ccgac^?SccaKc^ccgctgacgc 

?gc??acaSaca!gctctgaccgtctccgggagctgcatgtgtcagaggt 
a?ca?cga^cg^gcgaggcagctgcggtaaagctcatcagcgtggtcgtgaagcg 

aS5a5g^gcc?gttcatccgcgtccagctcgttgagtttctccag^ 
ctogcttctgataaagcgggccatgttaagggcggttttttcctgtttggtcactgatgc 

SccgtotaIS^^ 

gctca?gI^?gggttactgatgatg^^ 

2cSc5gS?ggta?ggatgcggcgggaccagagaa^ 

cttcgttaStacagatgtaggtgttccacagggtagccagcagcatcctgcgatgcagat 
ccggaI™a^tggtgcagggcgctgacttcc 

accgaagaccattcatgttgttgctcaggtcgcagacgttttgcagcagcagtcgcttca 
^^^^cgtatcggtgattcattctgctaacca 

c?ggg?cctcaacgacaggagcacgatcatgcgcacccgtggccaggacccaacgctgcc 

cgagSgScc^cgtgcggc^ 
g?tgS??5g?g?attcacagttctcc^ 



2^G^CGGCTTCCATTCAGGTCGAGGTGGCC^GG 



GAATCCGTTAGCGAGGTGCr " " — JGCTCCATGCA 



CCGCGACGCAACGCGGGGAGGCAGACAAGGTATAGGGCGGCGCCTACAATCCATGCCAAC 

^ATAAATCGC 

^CGATCCTTG 

ACCTGCCTGGACAGCATGGCCTGCAACGCGGGCATCCCGATGCCGCCGGAAGCGAGAAGA 



CCGTTCCATGTGCTCGCCGAGGCGGCATAAATCGCCGTGACGATCAGCGGTCCAG 
GAAGTTAGGCTGGTAAGAGCCGCGAGCGATCCTTGAAGCTGTCCCTGATGGTCGTCATCT 



cc1S????cc?^^ 

GGTGATGCCGGCCACGATGCGTCCGGCGTAGAGG 




S 

Cl 

w 

ni 
o 
o 





pDONR2 07 5584 bp 

AACTGCCAGGCATCAAACTAAGCAGAAGGCCATCCTG^ 
ACAAACTCTTCCTGGCTAGCGGTJ 
AAGAACATGTGAGO^^ 



] AAACTCTT C CTGGCT AG CGGT AAT ACGGTT AT C C ACAGAAT C AGGGGAT AACG CAGGA 
^GAACATGTGAGCAAAAGGCCAGC " 
- GTTT TTCCATAGGCTCCGCCCCC 
AGGTGGCGAAACCCGACAGGACTA^™^^ 



acaaactct^^^^-^ 

GCGTTTTTCCATAGOT^ 



GGTAACT AT CGTCTTGAGTC CAAC C CGG r 
ACTGGTAACAGGATTAGCAGAGCGAGGTi 
TGGCCTAACTACGGCTACACTAGAAGGA' 



ACTGGTAACAGGATTAGC 



aEtStaacacga^agcagagcg^g^^^ 



AATTAACCAATTCTGATTAGAAAAACTCATCGAGCATCAAA 



i==liii=s 



ggcagtcgccctaaaacaaagtt^ggtggctcaagta 



?TACGCAGCAGCAACGATGTTACGCAGCAG 



3CAGTCTCCGGCGAGCACCGGAGGCAGGGC2 
CATGAGGCCAACGCGCTTGGTGCTTATGTG2 
CCCGCAGTGGCTCTCTATACAAAGTTGGGCJ 
GACCCAAGTACCGCCACCTAACAATTCGTTC 
CCCCTCGTCAAAAATAAGGTTATCAAGTGAC 
TGAGAATGGCAAAAGTTTATGCATTTCTTTC 
CTCGTCATCAAAATCACTCGCATCAACCAAi 
GAGACGAAATACGCGATCGCTGTTAAAAGGi 
GCGCAGGAACACTGCCAGCGCATCAACAATj 

iiiSsisijiiisssJ 



SS=S5S==S 

CCCGCAGTGGCTCTCTATACAAAGI 

gacccaagtaccgccacctaacaat 
cccctcgtcaaaaataaggttatc; 

TGAGAATGGCAAAAGTTTATGCAT'] 
CTCGTCATCAAAATCACTCGCATCi 



CATGAGGCCAA^^r, ^ ™ TTGGGCATACGGGAAGAAGTGATGCACTTTGATATC 

^CAATTCGTTCA? 
TATCAAGTGAGA2 
jCATTTCTTTCCZ 

CTCG^CATCAAAATCACTCGCATCAAC^AAACC^^^^^^^^^^^^^^^^^^^^^^q 



CCCGCAGTGGCTCTCTATACAAAGTTGGG^ATA^-^^^^^^^^^^^^^^^^^^^^^^ 

jvggttatcaagt 1 
toagaa?g5caaaagtttat^^ 



TTCCCGTTGAATATGGCTCATAACACCCCCTGTATTACTGTTTATGTAAGCAGACAGTTT 

TA r . 
CA1 
.TG< 

A7\ * 

AGACTACATAATACTGTAAAACACAACATATCCAGTCACTATGAATCAACTACTTAGATG. 



TATTGTTCATGATGATATATTTTTATCTTGTGCAATGTAACATCAGAGATTTTGAGACAC 

lT GG C AAAT AAT ( 

AACGAGAAACGTAAAATGATAT^A^ 





TTCAGGAGCTAAGGAAGCTAAAATGGAC 



ATCCCAATGGCATCGTAAAGAACATTTTGAGGCATTTCAGTCAGTTGCTCAAT 

CACCAGTTTTGATTTAAACGTC 
GGG C AAAT AT T AT ACG C AAGG C 

TATGTCAAAAAGAGGTGTGCTATGAAGCAGCGTATTACAGTGACAGTTGAC 

TATCAGTTGCTCAAGGCATATATGATGTC 
&GAATGAAGCCCGTCGTCTGCGTGCCGAJ 
CTGAGGTCGCCCGGTTTATTGAAATGAA< 
GAAATGCAGTTTAAGGTTTACACCTATAi 

GT AC AG AGTG AT AT T ATTGAC ACG C C CGI 
CGTCTGCTGTCAGATAAAGTCTCCCGTG 
AGCTGGCGCATGATGACCACCGA^GG_ 



CCGGCAGTTTCTACACATATATTCGCAAGATGTGGCGTGTTACGGTGAAAACCTGGCCT 

TTTCCCTAAAGGGTTTATT 
CACCAGTTTTGATTTAAAC 
GGGCAAATATTATACGCAA 
TGCCGTCTGTGATGGCTTC 

TGAGTGGCAGGGCGGGGCGi*^^^^^^-^^-^^-^ ATA ^^^ ATA ^Q,p^^QQQQ AA Q 
TATGTCAAAAAGAGGTGTGCTATGAAGCAG^GTATTAC 

JTGCCG2 
^AATGAZ 

^cctat; 

\CGCCCC 

CGTCTGCTGTCAGATAAAGTCTCCCG^ 



TATCAGTTGCTCAAGGCA^ 

:tgaaatga7 

["ACACCTATi 

GTACAGAGTCATATTATTGAC 



agaatgaagcccgtcgtctg^^x-^^^^------ acgagaacagggactggt 

CTGAGGTCGCCCGGTTTATTGAAATGAACGGCTCTTTT^ 
GAAATGCAGTTTAAGGTTTACACCTATAAAAGAGAGAGCCGT 



GTGGCTGATCTCAGCCACCGCGAAAAT^A 
GGAATATAAATGTCAGGCTCCCTTATACA^^^ 

^a?aISa™gtgaaattgttcttgatgcagatgattttcagga 
Sa?gaataggtagatgtttttattttgtcacacaaaaaagaggc 

rTATTTCTTTTTATGATTTAATACGGCATTGAGGACAATAGCGAG 

"It?c?gS?gagaagaacatttggaaggctgtcgg 
-a^gaacatttggaaggctgtcggtcgactacaggtcactaatac 
tt^atagtgactggatatgttgtgttttacagtattatgtagtct 
???a!t?™5atattgatatttatatcattttacgtttctcgtt 

IS^rCCATTATAAAAAAGCATTGCTCATCAATTTGTTGCAACG 
ScAGG^CtS 



TAAC 




fart* 1** 
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GCCTTACGCATCTGTGCGGTATTTCACACCGCAGGCAAGTGCACAAACAATACTTAAATA 

AATACTACTCAGTAATAACCTATTTCTTAGCATTTTTGACGAAATTTGCTATTTTGTTAG 

AGTCTTTTACACCATTTGTCTCCACACCTCCGCTTACATCAACACCAATAACGCCATTTA 

ATCTAAGCGCATCACCAACATTTTCTGGCGTCAGTCCACCAGCTAACATAAAATGTAAGC 

TTTCGGGGCTCTCTTGCCTTCCAACCCAGTCAGAAATCGAGTTCCAATCCAAAAGTTCAC 

CTGTCCCACCTGCTTCTGAATCAAACAAGGGAATAAACGAATGAGGTTTCTGTGAAGCTG 

CACTGAGTAGTATGTTGCAGTCTTTTGGAAATACGAGTCTTTTAATAACTGGCAAACCGA 

GGAACTCTTGGTATTCTTGCCACGACTCATCTCCATGCAGTTGGACGATATCAATGCCGT 

AATCATTGACCAGAGCCAAAACATCCTCCTTAGGTTGATTACGAAACACGCCAACCAAGT 

ATTTCGGAGTGCCTGAACTATTTTTATATGCTTTTACAAGACTTGAAATTTTCCTTGCAA 

TAACCGGGTCAATTGTTCTCTTTCTATTGGGCACACATATAATACCCAGCAAGTCAGCAT 

CGGAATCTAGAGCACATTCTGCGGCCTCTGTGCTCTGCAAGCCGCAAACTTTCACCAATG 

GACCAGAACTACCTGTGAAATTAATAACAGACATACTCCAAGCTGCCTTTGTGTGCTTAA 

TCACGTATACTCACGTGCTCAATAGTCACCAATGCCCTCCCTCTTGGCCCTCTCCTTTTC 

TTTTTTCGACCGAATTAATTCTTAATCGGCAAAAAAAGAAAAGCTCCGGATCAAGATTGT 

ACGTAAGGTGACAAGCTATTTTTCAATAAAGAATATCTTCCACTACTGCCATCTGGCGTC 

ATAACTGCAAAGTACACATATATTACGATGCTGTCTATTAAATGCTTCCTATATTATATA 

TATAGTAATGTCGTTTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAA 

GCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGG 

CATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCAC 

CGTCATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTA 

ATGTCATGATAATAATGGTTTCTTAGGACGGATCGCTTGCCTGTAACTTACACGCGCCTC 

GTATCTTTTAATGATGGAATAATTTGGGAATTTACTCTGTGTTTATTTATTTTTATGTTT 

TGTATTTGGATTTTAGAAAGTAAATAAAGAAGGTAGAAGAGTTACGGAATGAAGAAAAAA 

AAATAAACAAAGGTTTAAAAAATTTCAACAAAAAGCGTACTTTACATATATATTTATTAG 

ACAAGAAAAGCAGATTAAATAGATATACATTCGATTAACGATAAGTAAAATGTAAAATCA 

CAGGATTTTCGTGTGTGGTCTTCTACACAGACAAGATGAAACAATTCGGCATTAATACCT 

GAGAGCAGGAAGAGCAAGATAAAAGGTAGTATTTGTTGGCGATCCCCCTAGAGTCTTTTA 

CATCTTCGGAAAACAAAAACTATTTTTTCTTTAATTTCTTTTTTTACTTTCTATTTTTAA 

TTTATATATTTATATTAAAAAATTTAAATTATAATTATTTTTATAGCACGTGATGAAAAG 

GACCCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAA 

ATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATAT 

TGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCG 

GCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAA 

GATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTT 

GAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGT 

GGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTAT 

TCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATG 

ACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTA 

CTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTTCACAACATGGGGGAT 

CATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAG 

CGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAA 

CTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCA 

GGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCC 

GGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGT 

ATCGTAGTTATCTACACGACGGGCAGTCAGGCAACTATGGATGAACGAAATAGACAGATC 

GCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATAT 

ATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTT 

TTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGAC 

CCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGC 

TTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCA 

ACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTA 

GTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCT 

CTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTG 

GACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGC- 



ACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCAT 

TGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGG 

GTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGGAACGCCTGGTATCTTTATAGT 

CCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGG 

CCGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGG 

CCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACC 

GCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTG 

AGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATT 

CATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCA 

ATTAATGTGAGTTACCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCT 

CCTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCAT 

GATTACGCCAAGCTCGGAATTAACCCTCACTAAAGGGAACAAAAGCTGGGTACCGGGCCC 

C C C CT CG AGATC CGGGAT CGAAGAAATGATGGT AAATGAAAT AGGAAATC AAGGAGC ATG 

AAGGCAAAAGACAAATATAAGGGTCGAACGAAAAATAAAGTGAAAAGTGTTGATATGATG 

TATTTGGCTTTGCGGCGCCGAAAAAACGAGTTTACGCAATTGCACAATCATGCTGACTCT 

GTGGCGGACCCGCGCTCTTGCCGGCCCGGCGATAACGCTGGGCGTGAGGCTGTGCCCGGC 

GGAGTTTTTTGCGCCTGCATTTTCCAAGGTTTACCCTGCGCTAAGGGGCGAGATTGGAGA 

AGCAATAAGAATGCCGGTTGGGGTTGCGATGATGACGACCACGACAACTGGTGTCATTAT 

TTAAGTTGCCGAAAGAACCTGAGTGCATTTGCAACATGAGTATACTAGAAGAATGAGCCA 

AGACTTGCGAGACGCGAGTTTGCCGGTGGTGCGAACAATAGAGCGACCATGACCTTGAAG 

GTGAGACGCGCATAACCGCTAGAGTACTTTGAAGAGGAAACAGCAATAGGGTTGCTACCA 

GTATAAATAGACAGGTACATACAACACTGGAAATGGTTGTCTGTTTGAGTACGCTTTCAA 

TTCATTTGGGTGTGCACTTTATTATGTTACAATATGGAAGGGAACTTTACACTTCTCCTA 

TGCACATATATTAATTAAAGTCCAATGCTAGTAGAGAAGGGGGGTAACACCCCTCCGCGC 

TCTTTTCCGATTTTTTTCTAAACCGTGGAATATTTCGGATATCCTTTTGTTGTTTCCGGG 

TGTACAATATGGACTTCCTCTTTTCTGGCAACCAAACCCATACATCGGGATTCCTATAAT 

ACCTTCGTTGGTCTCCCTAACATGTAGGTGGCGGAGGGGAGATATACAATAGAACAGATA 

CCAGACAAGACATAATGGGCTAAACAAGACTACACCAATTACACTGCCTCATTGATGGTG 

GTACATAACGAACTAATACTGTAGCCCTAGACTTGATAGCCATCATCATATCGAAGTTTC 

ACTACCCTTTTTCCATTTGCCATCTATTGAAGTAATAATAGGCGCATGCAACTTCTTTTC 

TTTTTTTTTCTTTTCTCTCTCCCCCGTTGTTGTCTCACCATATCCGCAATGACAAAAAAA 

ATGATGGAAGACACTAAAGGAAAAAATTAACGACAAAGACAGCACCAACAGATGTCGTTG 

TTCCAGAGCTGATGAGGGGTATCTTCGAACACACGAAACTTTTTCCTTCCTTCATTCACG 

CACACTACTCTCTAATGAGCAACGGTATACGGCCTTCCTTCCAGTTACTTGAATTTGAAA 

TAAAAAAAGTTTGCCGCTTTGCTATCAAGTATAAATAGACCTGCAATTATTAATCTTTTG 

TTTCCTCGTCATTGTTCTCGTTCCCTTTCTTCCTTGTTTCTTTTTCTGCACAATATTTCA 

AGCTATACCAAGCATACAATCAACTCCAAGCTTATGCCCAAGAAGAAGCGGAAGGTCTCG 

AGCGGCGCCAATTTTAATCAAAGTGGGAATATTGCTGATAGCTCATTGTCCTTCACTTTC 

ACTAACAGTAGCAACGGTCCGAACCTCATAACT^ACTCAAACAAATTCTCAAGCGCTTTCA 

CAACCAATTGCCTCCTCTAACGTTCATGATAACTTCATGAATAATGAAATCACGGCTAGT 

AAAATTGATGATGGTAATAATTCAAAACCACTGTCACCTGGTTGGACGGACCAAACTGCG 

TATAACGCGTTTGGAATCACTACAGGGATGTTTAATACCACTACAATGGATGATGTATAT 

AACTATCTATTCGATGATGAAGATACCCCACCAAACCCAAAAAAAGAGGGTGGGTCGATC 

ACAAGTTTGTACAAAAAAGCAGGCTTGTCGACCCCGGGAATTCAGATCTACTAGTGCGGC 

CGCACGCGTACCCAGCTTTCTTGTACAAAGTGGTGACGTCGAGCTCCCTATAGTGAGTCG 

TATTACACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACACCGGTGAGCTCTAAGT 

AAGTAACGGCCGCCACCGCGGTGGAGCTTTGGACTTCTTCGCCAGAGGTTTGGTCAAGTC 

TCCAATCAAGGTTGTCGGCTTGTCTACCTTGCCAGAAATTTACGAAAAGATGGAAAAGGG 

TCAAATCGTTGGTAGATACGTTGTTGACACTTCTAAATAAGCGAATTTCTTATGATTTAT 

GATTTTTATTATTAAATAAGTTATAAAAAAAATAAGTGTATACAAATTTTAAAGTGACTC 

TTAGGTTTTAAAACGAAAATTCTTGTTCTTGAGTAACTCTTTCCTGTAGGTCAGGTTGCT 

TTCTCAGGTATAGCATGAGGTCGCTCTTATTGACCACACCTCTACCGGCATGCCGAGCAA 

ATGCCTGCAAATCGCTCCCCATTTCACCCAATTGTAGATATGCTAACTCCAGCAATGAGT 

TGATGAATCTCGGTGTGTATTTTATGTCCTCAGAGGACAATACCTGTTGTAATCGTTCTT 

CCACACGGATCCGCATCAGGCGAAATTGTAAACGTTAATATTTTGTTAAAATTCGCGTTA 

AATATTTGTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTAT 

AAATCAAAAGAATAGACCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCA 

CTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGC- 




CCACTACGTGAACCATCACCCTAATCAAGTTTTTTGGGGTCGAGGTGCCGTAAAGCACTA 
AATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTG 
GCGAGAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCG 
GTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTACAGGGCGCGTCC 

CATTCGCCATTCACTGCA 
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GACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTT 

CTTAGGACGGATCGCTTGCCTGTAACTTACACGCGCCTCGTATCTTTTAATGATGGAATA 

ATTTGGGAATTTACTCTGTGTTTATTTATTTTTATGTTTTGTATTTGGATTTTAGAAAGT 

AAATAAAGAAGGTAGAAGAGTTACGGAATGAAGAAAAAAAAATAAACAAAGGTTTAAAAA 

AT T T C AAC AAAAAG CGT ACT T T AC AT AT ATATTT ATT AGAC AAG AAAAG C AG ATT AAAT A 

GATATACATTCGATTAACGATAAGTAAAATGTAAAATCACAGGATTTTCGTGTGTGGTCT 

TCTACACAGACAAGATGAAACAATTCGGCATTAATACCTGAGAGCAGGAAGAGCAAGATA 

AAAGGTAGTATTTGTTGGCGATCCCCCTAGAGTCTTTTACATCTTCGGAAAACAAAAACT 

ATTTTTTCTTTAATTTCTTTTTTTACTTTCTATTTTTAATTTATATATTTATATTAAAAA 

ATTTAAATTATAATTATTTTTATAGCACGTGATGAAAAGGACCCAGGTGGCACTTTTCGG 

GGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCG 

CTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGT 

ATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTT 

GCT CAC C CAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTG 

GGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAA 

CGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATT 

GACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAG 

TACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGT 

GCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGA 

CCGAAGGAGCTAACCGCTTTTTTTCACAACATGGGGGATCATGTAACTCGCCTTGATCGT 

TGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTA 

GCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGG 

CAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCC 

CTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGT 

ATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACG 

GGCAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTG 

ATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAA 

CTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAA 

ATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGA 

TCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCG 

CTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACT 

GGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCAC 

CACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTG 

GCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCG 

GATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGA 

ACGACCTACACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCC 

GAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACG 

AGGGAGCTTCCAGGGGGGAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTC 

TGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCCGAGCCTATGGAAAAACGCC 

AGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTT 

CCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACC 

GCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGC 

CCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGAC 

AGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTACCTCACT 

CATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCCTATGTTGTGTGGAATTGTG 

AGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTCGGAATT 

AACCCTCACTAAAGGGAACAAAAGCTGGGTACCGGGCCCCCCCTCGAGATCCGGGATCGA 

AGAAATGATGGTAAATGAAATAGGAAATCAAGGAGCATGAAGGCAAAAGACAAATATAAG 

GGTCGAACGAAAAATAAAGTGAAAAGTGTTGATATGATGTATTTGGCTTTGCGGCGCCGA 

AAAAACGAGTTTACGCAATTGCACAATCATGCTGACTCTGTGGCGGACCCGCGCTCTTGC 

CGGCCCGGCGATAACGCTGGGCGTGAGGCTGTGCCCGGCGGAGTTTTTTGCGCCTGCATT 

TTCCAAGGTTTACCCTGCGCTAAGGGGCGAGATTGGAGAAGCAATAAGAATGCCGGTTGG 

GGTTGCGATGATGACGACCACGACAACTGGTGTCATTATTTAAGTTGCCGAAAGAACCTG 

AGTGCATTTGCAACATGAGTATACTAGAAGAATGAGCCAAGACTTGCGAGACGCGAGTTT 

GCCGGTGGTGCGAACAATAGAGCGACCATGACCTTGAAGGTGAGACGCGCATAACCGCTA- 



GAGTACTTTGAAGAGGAAACAGCAATAGGGTTGCTACCAGTATAAATAGACAGGTACATA 

CAACACTGGAAATGGTTGTCTGTTTGAGTACGCTTTCAATTCATTTGGGTGTGCACTTTA 

TTATGTTACAATATGGAAGGGAACTTTACACTTCTCCTATGCACATATATTAATTAAAGT 

CCAATGCTAGTAGAGAAGGGGGGTAACACCCCTCCGCGCTCTTTTCCGATTTTTTTCTAA 

ACCGTGGAATATTTCGGATATCCTTTTGTTGTTTCCGGGTGTACAATATGGACTTCCTCT 

TTTCTGGCAACCAAACCCATACATCGGGATTCCTATAATACCTTCGTTGGTCTCCCTAAC 

ATGTAGGTGGCGGAGGGGAGATATACAATAGAACAGATACCAGACAAGACATAATGGGCT 

AAACAAGACTACACCAATTACACTGCCTCATTGATGGTGGTACATAACGAACTAATACTG 

TAGCCCTAGACTTGATAGCCATCATCATATCGAAGTTTCACTACCCTTTTTCCATTTGCC 

ATCTATTGAAGTAATAATAGGCGCATGCAACTTCTTTTCTTTTTTTTTCTTTTCTCTCTC 

C C C CGTTGTTGTC T CAC C AT AT C CG CAATGAC AAAAAAAATGATGGAAGAC ACT AAAGG A 

AAAAATTAACGACAAAGACAGCACCAACAGATGTCGTTGTTCCAGAGCTGATGAGGGGTA 

TCTTCGAACACACGAAACTTTTTCCTTCCTTCATTCACGCACACTACTCTCTAATGAGCA 

ACGGTATACGGCCTTCCTTCCAGTTACTTGAATTTGAAATAAAAAAAGTTTGCCGCTTTG 

CTATCAAGTATAAATAGACCTGCAATTATTAATCTTTTGTTTCCTCGTCATTGTTCTCGT 

TCCCTTTCTTCCTTGTTTCTTTTTCTGCACAATATTTCAAGCTATACCAAGCATACAATC 

AACTCCAAGCTTATGCCCAAGAAGAAGCGGAAGGTCTCGAGCGGCGCCAATTTTAATCAA 

AGTGGGAATATTGCTGATAGCTCATTGTCCTTCACTTTCACTAACAGTAGCAACGGTCCG 

AACCTCATAACAACTCAAACAAATTCTCAAGCGCTTTCACAACCAATTGCCTCCTCTAAC 

GTTCATGATAACTTCATGAATAATGAAATCACGGCTAGTAAAATTGATGATGGTAATAAT 

TCAAAACCACTGTCACCTGGTTGGACGGACCAAACTGCGTATAACGCGTTTGGAATCACT 

ACAGGGATGTTTAATACCACTACAATGGATGATGTATATAACTATCTATTCGATGATGAA 

GATACCCCACCAAACCCAAAAAAAGAGGGTGGGTCGATCACAAGTTTGTACAAAAAAGCA 

GGCTTGTCGACCCCGGGAATTCAGATCTACTAGTGCGGCCGCACGCGTACCCAGCTTTCT 

TGTACAAAGTGGTGACGTCGAGCTCTAAGTAAGTAACGGCCGCCACCGCGGTGGAGCTTT 

GGACTTCTTCGCCAGAGGTTTGGTCAAGTCTCCAATCAAGGTTGTCGGCTTGTCTACCTT 

GCCAGAAATTTACGAAAAGATGGAAAAGGGTCAAATCGTTGGTAGATACGTTGTTGACAC 

TTCTAAATAAGCGAATTTCTTATGATTTATGATTTTTATTATTAAATAAGTTATAAAAAA 

AATAAGTGTATACAAATTTTAAAGTGACTCTTAGGTTTTAAAACGAAAATTCTTGTTCTT 

GAGTAACTCTTTCCTGTAGGTCAGGTTGCTTTCTCAGGTATAGCATGAGGTCGCTCTTAT 

TGACCACACCTCTACCGGCATGCCGAGCAAATGCCTGCAAATCGCTCCCCATTTCACCCA 

ATTGTAGATATGCTAACTCCAGCAATGAGTTGATGAATCTCGGTGTGTATTTTATGTCCT 

C AGAGGAC AAT AC C TGTTGT AAT CGTT CTT C C AC ACGGAT C CCAATT CG C C CT AT AGTGA 

GTCGTATTACAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGT 

TACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGA 

GGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGACGCGCCC 

TGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTT 

GCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCC 

GGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTA 

CGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCC 

TGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTG 

TTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATT 

TTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAAT 

TTTAACAAAATATTAACGTTTACAATTTCCTGATGCGGTATTTTCTCCTTACGCATCTGT 

GCGGTATTTCACACCGCAGGCAAGTGCACAAACAATACTTAAATAAATACTACTCAGTAA 

TAACCTATTTCTTAGCATTTTTGACGAAATTTGCTATTTTGTTAGAGTCTTTTACACCAT 

T TGT C T C CAC AC C T C C G C T T AC AT C AAC AC C AAT AAC G C C ATTT AAT C T AAG C G CAT CAC 

CAACA.TTTTCTGGCGTCAGTCCACCAGCTAACATAAAATGTAAGCTTTCGGGGCTCTCTT 

GCCTTCCAACCCAGTCAGAAATCGAGTTCCAATCCAAAAGTTCACCTGTCCCACCTGCTT 

CTGAATCAAACAAGGGAATAAACGAATGAGGTTTCTGTGAAGCTGCACTGAGTAGTATGT 

TGCAGTCTTTTGGAAATACGAGTCTTTTAATAACTGGCAAACCGAGGAACTCTTGGTATT 

CTTGCCACGACTCATCTCCATGCAGTTGGACGATATCAATGCCGTAATCATTGACCAGAG 

CCAAAACATCCTCCTTAGGTTGATTACGAAACACGCCAACCAAGTATTTCGGAGTGCCTG 

AACTATTTTTATATGCTTTTACAAGACTTGAAATTTTCCTTGCAATAACCGGGTCAATTG 

TTCTCTTTCTATTGGGCACACATATAATACCCAGCAAGTCAGCATCGGAATCTAGAGCAC 

ATTCTGCGGCCTCTGTGCTCTGCAAGCCGCAAACTTTCACCAATGGACCAGAACTACCTG 

TGAAATTAATAACAGACATACTCCAAGCTGCCTTTGTGTGCTTAATCACGTATACTCACG 

TGCTCAATAGTCACCAATGCCCTCCCTCTTGGCCCTCTCCTTTTCTTTTTTCGACCGAAT- 



